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OUTLINE 


In this Outline we give a brief description of each item listed in the Contents. 
While the Contents and Index are quick ways to search, or learn the general layout 
of the book, the Outline gives more detail for the uninitiated. (The PDF version also 
allows use of the “Find” command in PDF readers.) 


dlc) C31: es Pere ee re ee ee ee a eee eee er ee es ee eee 23 
general remarks on style, organization, focus, content, use, differences from other 
texts, etc. 

pone: Teld Theory TORS. cies 0404 beset de Heda ie ee eeduddentatesatered 36 


recommended alternatives or supplements (but see Preface) 


Lice... PART ONE: SYMMETRY .....00.. 
Relativistic quantum mechanics and classical field theory. Poincaré group = special 
relativity. Enlarged spacetime symmetries: conformal and supersymmetry. Equations 
of motion and actions for particles and fields/wave functions. Internal symmetries: 
global (classifying particles), local (field interactions). 


I. Global 
Spacetime and internal symmetries. 
A. Coordinates 

spacetime symmetries 

hs WWion@el at Wy es ite aeens tenes uheae dle cekeee eke eis eeeaedsne denne 39 
Poisson bracket, Einstein summation convention, Galilean symmetry (in- 
troductory example) 

B.. POrmMiG@is 3 4 ves aeaetieen pice ded cotictnk < bse ai nek abet oa Been ee ee 46 
statistics, anticommutator; anticommuting variables, differentiation, in- 
tegration 

Oe Mie al@ebra. i. 2 sence een pened ete er eceuseniel elds eebine rh xe yen 51 
general structure of symmetries (including internal); Lie bracket, group, 
structure constants; brief summary of group theory 

A, RELSUIVALY rice ee tect don Laie dba eaitedotede behead aed 54 
Minkowski space, antiparticles, Lorentz and Poincaré symmetries, proper 
time, Mandelstam variables, lightcone bases 

Be Wigerenes GP. Ws cox eg hed eras veda ed  dd ee eed eed eee bees 65 
charge conjugation, parity, time reversal, in classical mechanics and field 
theory; Levi-Civita tensor 


6. Conformal .........00 0. ccc ccc cnc e een e ene eneeneeneea 68 


broken, but useful, enlargement of Poincaré; projective lightcone 


B. Indices 


easy way to group theory 


My WAAL ICOS O10 heats ih ds co ee hana Jodie sual tandice ca degse nels 73 
Hilbert-space notation 
2. Representations: cee cc crvetuae word ee ateresd kelea vet tuekere reins 76 


adjoint, Cartan metric, Dynkin index, Casimir, (pseudo)reality, direct 


sum and product 


- IDSCCT MINAS): 28 2.6 ists geet ein leaene Sy dsc aaah oteer doladenwacts 81 
with Levi-Civita tensors, Gaussian integrals; Pfaffian 

# lASSIe al OPO Sd: ete Oh cmd detrei beer dokes tae weeds eeu Sh ew eves 84 
and generalizations, via tensor methods 

 ENSO NOLALION 3053.24 dwn heel RRL Ra ee eee 86 


index notation, simplest bases for simplest representations 


C. Representations 


useful special cases 


Ly. More Coordinates: os 5 rsiaccs orc ms whew anced Rabe pans Sas Re ees wes 92 
Dirac gamma matrices as coordinates for orthogonal groups 
2; Coorditiate’ tensors. isosccc20 ea Se Bee Ds Nd a 94 


II. Spin 


formulations of coordinate transformations; differential forms 


. Youne tableaux ek ey ee ee hehe ee ees 99 


pictures for representations, their symmetries, sizes, direct products 


. Colorand: 1avOn oc ves neke eG vow be wR woah Mee one b Dee Gi Nels 101 


symmetries of particles of Standard Model and observed light hadrons 


 COVErINS OVONPS 2720s tet ec eae ta ees Sa Be a 107 


relating spinors and vectors 


Extension of spacetime symmetry to include spin. Field equations for field strengths 


of all spins. Most efficient methods for Lorentz indices in QuantumChromoDynamics 


or pure Yang-Mills. Supersymmetry relates bosons and fermions, also useful for QCD. 


A. Two components 


2x2 matrices describe the spacetime groups more easily (2<4) 


Ds: BHVECLOLS iarestictiie cane een he 18a YoG ed bitin deena nO An daels Bitten Gite oe a 110 
algebraic properties of 2x2 matrices, vectors as quaternions 
Do TROUALIONS 23 525 Ch oasd enact nen gh esinse oe Died axa ah athe tad pil 114 


in three (space) dimensions 
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Os “SO PINOLS 61 fu crdinned dads bienerecads qiteeebind eceacsenee ene decee 115 
basis for spinor notation 
Me: MICS ac 52a todo ond eee dads Sos bs eee bat ede dak eene ees 1g 


review of spin in simpler notation: many indices instead of bigger; tensor 


notation avoids Clebsch-Gordan-Wigner coefficients 


still 2x2 matrices, but four dimensions; dotted and undotted indices; 


antisymmetric tensors; matrix identities 


i TA secs stiaree eh pote wg ides as oma ch ian tor wae vee Aaa acs Rawk wi ones 126 
example in free field theory; 4-component identities 
« MOBPANEY (OUST s ocuvaiatacancingadstouraweeiceimed resend cess wee 128 


chiral symmetry, simpler with two-component spinor indices; more exam- 


ples; duality 


. Poincaré 


relativistic solutions 


1. Field equations so cccc conc cciachecedddeened esi dennewesoaa ee anics 131 
conformal group as unified way to all massless free equations 

Be WOM AMS die sics wine wane de ee bauedescs rece Rea amee Re ene Rabson eee: 134 
reproduction of familiar cases (Dirac and Maxwell equations) 

Be OM NG ig oscars. Bag Haden a eek ag sega cesieenume ten detorscotee aieenauds 13f 
proof; lightcone methods; transformations 

Os WAG Sans eee adetewsdteringiieneesce seen eedaareeeruneanareweee ie 141 


dimensional reduction; Sttickelberg formalism for vector in terms of mass- 
less vector + scalar 

Be. Poldy= WoUbhuyeen ai c2d.s.dscceeven teed etead eee edness 144 
an application, for arbitrary spin, from massless analog; transformation 
to nonrelativistic + corrections; minimal electromagnetic coupling to spin 
1/2; preparation for nonminimal coupling in chapter VUI for Lamb shift 

Gx TIWWISNOPS: .5405565506a2 scien ess cited nled ce ee tsteo ka eedas ed 148 
convenient and covariant method to solve massless equations; related to 
conformal invariance and self-duality; useful for QCD computations in 
chapter VI 

Dd Cy aoe csr eae ep ie he Sree oie een reine A ee ede eee 151 
via twistors; Penrose transform 

. Supersymmetry 

symmetry relating fermions to bosons, generalizing translations; general prop- 


erties, representations 


III. Local 


2 AIDE Bete dd nosed a acne aues bed bade bin Gian hace drcnenen ees 156 
definition of supersymmetry; positive energy automatic 
< PUPerCOOrC MALES abc cies get aks bua ia tenn viele acre ene raden 157 


superspace includes anticommuting coordinates; covariant derivatives 


generalize spacetime derivatives 


de MUPSPSTOUNSe .a eee orcl na aveke were dos oet eee eew ah isew oe tae Bee ee 160 
generalizing classical groups; supertrace, superdeterminant 

A. “Supercontormial e223 co Nis easy ss athaed lhe Pad eas ene Citak eae 163 
also broken but useful, enlargement of supersymmetry, as classical group 

fh. URPOREWISLOLG A: ice dic dst Sia Saati: ever doe ia Sateen o a xan Wee deca iy 164 


massless representations of supersymmetry 


Symmetries that act independently at each point in spacetime. Basis of fundamental 


forces. 


A. 


Actions 


for previous examples (spins 0, 1/2, 1) 


Die RrONnebal. guceteoe oooe staan weed Roo ania yes Kal ee as HES ey ESS BS 169 
action principle, variation, functional derivative, Lagrangians 

De BOPUMOUSy isis yee N helen ee hs a A pee en nad te 174 
quantizing anticommuting quantities; spin 

i SE BONIS de a8 wise t teat ck nn re a ne ited Gi te aha Rind Acer the aa odie meh 176 
actions in nonrelativistic field theory, Hamiltonian and Lagrangian den- 
sities 

A; ROIALIVILY 44 cI onc beae etal ete Ae eae Se Meta aos hath oe 180 


relativistic particles and fields, charge conjugation, good ultraviolet be- 


havior, general forces 


. Constrained systems............... 0.0. ees 186 


role of gauge invariance; first-order formalism; gauge fixing 


Particles 


relativistic classical mechanics; useful later in understanding Feynman dia- 


grams; simple example of local symmetry 


Te ITSO et ab feast edt ot dh ain eras in ae abt eats os ete ns a Ge te sh wd wh 191 
worldline metric, gauge invariance of actions 

Bis CANIS OS ioe Guia hveedewan teh ee eilete mae eb lake ane eae Uae UEdbowene 195 
gauge fixing, lightcone gauge 

Ba COU PUT ves ba 20) ace heen olga Prbe ath Gia. chad hotmase eee oars oo 197 


external fields 
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A MODSEr VATION, $5 ec veawe readers ken kaxdneastey bandetueeeatieeeeeua 198 
for classical particles; true vs. canonical energy 

Be. PA Crea lONs 35444-54451. eeeenin woh £6 -ebho iced eben Se 201 
and annihilation, for classical particle and antiparticle 

C. Yang-Mills 

self-coupling for spin 1; describes forces of Standard Model 

I INGHADGCHAR 2.24 S405 ded eck an cd aes peda eee ened aednd dena hada 204 
self-interactions; covariant derivatives, field strengths, Jacobi identities, 
action 

es MON CIN aa esc koe cit wees aigrs ple oak. ape Sunil oue eet pea tdenmeneen 208 
a unitary gauge; axial gauges; spin 1/2 

Se DTANG WAVES f< uc <Sperncesch ada ees re ekded ee ortasd -oceacueaeeeass 212 


simple exact solutions to interacting theory 


Ae BOUMCUAliEY oi.c:-5ediu seid oe eure Sie beck aden ee seustade Gen gheee ates 213 
and massive analog 

Hic WU WWISUONS sete ah os tae hee atn ns Meade ag aeg ered 217 
useful for self-duality; lightcone gauge for solving self-duality 

Gx, INS TAMCONS 6x 5 5.5652 ht wn bs PE eee Ohh ea sb Le eee 220 


nonperturbative self-dual solutions, via twistors; ’t Hooft ansatz; Chern- 


Simons form 


Ts, DIOTIM : ccexent catenaseuced ined aeewabetceeninen tat see eesy nee ees 224 
general instanton solution of Atiyah, Drinfel’d, Hitchin, and Manin 
Be INGNOD GIES oe wing nkee oe os Ketel cock Geto end at emer ened bee mees 226 


more nonperturbative self-dual solutions, but static 


IV. Mixed 
Global symmetries of interacting theories. Gauge symmetry coupled to lower spins. 
A. Hidden symmetry 
explicit and soft breaking, confinement 
1. Spontaneous breakdown................. 0.000 cece eee 22 


method; Goldstone theorem of massless scalars 


Be SIGMA WMOCEIS 2.5 oe ote ec se seige ene chanega vend egepesedwn eee rok es 234 
linear and nonlinear; low-energy theories of scalars 
Ds C Oset SPA ss cuted cadan deer enriewege pecepeunesdneesseceneds ase 237 


general construction, using gauge invariance, for sigma models 
dy. NOMGAl Sy MING Y 2 ec dee tua oars wend os dade oe Cee ea eas 240 
low-energy symmetry, quarks, pseudogoldstone boson, Partially Con- 


served Axial Current 
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SO RUCKE DOUG. dtc dah nate esate Sains bsed heehee bier beets deen eee 243 


scalars generate mass for vectors; free case 


PL TR Sete ere cae aes encode dats bocuhien Sask gn nea aes Enki uael erie ar ct trac 245 
same for interactions; Gervais-Neveu model; unitary gauge 
DilAtOnCOsinGloe ys jets eget eee eee i nd Pes 247 


cosmology with gravity replaced by Goldstone boson of scale invariance 


B. Standard model 
application to real world 


Te (Gh omod yramies 5c d52ti se) ied eae tees Pees 259 
strong interactions, using Yang-Mills; C and P 

2; Ecleetroweak.. bi cossucaas tyra eek eet woe oat ee Rete Gu ee eee 264 
unification of electromagnetic and weak interactions, using also Higgs 

Di VPAMMICS hao Osseo a hehe earl sO ieee aS ol he fn cea hy alot fein 267 


including all known fundamental leptons; Cabibbo-Kobayashi-Maskawa 


transformation; flavor-changing neutral currents 


. Grand Unified Theories..............0 00.0. c cc eens 269 


unification of all leptons and vector mesons 


C. Supersymmetry 


superfield theory, using superspace; useful for solving problems of perturba- 


tion resummation (chapter VIII) 


His PR et es hoe AO Sees eh ad es dite aM hs ede edit oa taeda aed Pan sans A halls 2k 2D 
simplest (“matter”) multiplet 

De INCUUOINS i. tea 08 led eis ie iti nr dek to deceatatigs aise ia neti Wiest rela ec otee hin Wahi 277 
to introduce interactions; component expansion, superfield equations 

3. Covariant derivatives ........ 0.0.6 e eens 280 
approach to gauge multiplet; vielbein, torsion; solution to Jacobi identities 

Ax PLCPOLCIIIAN 6 duce tecacinnn'en ne seeas Adele ese rede ueebinweu ded aawes 282 


fundamental superfield for constructing covariant derivatives; solution to 


constraints, chiral representation 


~~ MQAUIPECACTIONS Sok iced s Give saute aeons ee ee bed tea kG 284 
for gauge and matter multiplets; Fayet-Iliopoulos term 
AMOR o Sia a dda ei atten reat peas Bark ean add a ietn ad eee aera 287 


of supersymmetry; spurions 
PESTO A ate St Nite Ve ad forth cet ile Lata hc ct 0 De th Ms AE etd gdh Sic dea tee 289 
introduction to multiple supersymmetries; central charges 


ere PART TWO: QUANTA .........0.......... 
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Quantum aspects of field theory. Perturbation theory: expansions in loops, helicity, 


and internal symmetry. Although some have conjectured that nonperturbative ap- 


proaches might solve renormalization difficulties found in perturbation, all evidence 


indicates these problems worsen instead in complete theory. 


V. Quantization 


Quantization of classical theories by path integrals. Backgrounds fields instead of 


sources exclusively: All uses of Feynman diagrams involve either S-matrix or effective 


action, both of which require removal of external propagators, equivalent to replacing 


sources with fields. 
A. General 


various properties of quantum physics in general context, so these items need 


not be repeated in more specialized and complicated cases of field theory 


ue 


Pach, Weerals 2 2c.035105 peti sieee pe Pieass eed cate eine then 298 
Feynman’s alternative to Heisenberg and Schrodinger methods; relation 


to canonical quantization; unitarity, causality 


. Semiclassical expansion................0. 0c cece cence ees 303 


JWKB in path integral; free particle 


y PRO DAG AOlS. 62051. eda cena ed egee ease eed) oes ees 307 
Green functions; solution to Schrodinger equation via path integrals 

g MIMIMAONICOS ys.4 dann Senet ane ede 6o4 4 idee eedhedls Jad adds Ee od S4e Es 310 
scattering, most common use of field theory; unitarity, causality 

« Wick Pol alloc 607.5 cade neeaiunek eo es PES R RE es Bea aes EES 315 


imaginary time, to get Euclidean space, has important role in quantum 


mechanics 


B. Propagators 


relativistic quantum mechanics, free quantum field theory 


Me CNS oy cae Sala Grndse ve Renee Ben we ae ee 319 
Stiickelberg-Feynman propagator for spin 0; covariant gauge, lightcone 
gauge 

Bs PG ON NOS soci. coeec eee Senta swensnden ew buceareene gee ang dene ape ae a22 
features, relations to classical Green functions, inner product 

Dis CICTANIZ AU OTS oes og hap he ig aes Chap REL Ea EEE Rea SES 326 


other spins, nature of quantum corrections 
Wick POUR OM 10 o0.dh40265 oo dat dnddd whee eienarsedesldewedxtedues 329 


its relativistic use, in mechanics and field theory 
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C. S-matrix 
path integration of field theory produces Feynman diagrams/graphs; simple 
examples from scalar theories 
its PPathiinites tale. 2.7 pone durn da ie he boob nud sand eed aaa wks 334 
definition of initial/final states; generating functional of background fields; 


perturbative evaluation 


pictorial interpretation of perturbation theory; connected and one- 
particle-irreducible parts, effective action 

3. Semiclassical expansion................. 0. cece eee eens 344 
classical (tree) graphs give perturbative solution to classical field equa- 
tions 

A. Feynman: TUES pert vac ete nex kame reusente see eiue ious ee eler ha aen 349 
collection of simplified steps from action to graphs, in Wick-rotated (Eu- 
clidean) momentum space 

5. Semiclassical wnitarity..ccac: ¢icanke ee dais theese id we euaees 355 
properties of classical action needed for unitarity 

Gs Cutting: MINS sarscolweeuesea brew One Usebuwethks Reve iwes arenes 358 
diagrammatic translation of unitarity and causality 

fs COPOSS SECTIONS .ce Yow ie co. Sore Dae Pees Be eine eg BOLE a 361 


scattering probabilities; differential cross sections; cut propagators 


8. SING GATES sie he i ek ee es ee ee wh ea 366 
relation of Landau singularities in momenta to classical mechanics 
0: Group: CHOP 15/5 pordine ee era tans ogee kine evan eral en dmneie ren 368 


quark line rules for easily dealing with group theory in graphs 


VI. Quantum gauge theory 
Gauge fixing and more complicated vertices require additional methods. 
A. Becchi-Rouet-Stora-Tyutin 
easiest way to gauge fix, with fermionic symmetry relating unphysical degrees 
of freedom; unitarity clear by relating general gauges to unitary gauges; gen- 
eral discussion in framework of quantum physics and canonical quantization, 
so field theory can be addressed covariantly with path integrals 
Dy Tai Gonianic «:n.05 sy oe de aes ev dB Oe ea Caer deena ee 373 
canonical quantization, ghosts (unphysical states), cohomology (physical 
states /operators) 
2. Ma@ban iat) 727-2) a tiee ity suse eetea eaten etek tee ean 378 


relation to Hamiltonian approach, Nakanishi-Lautrup auxiliary fields 
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Be PAPI CCS. 2 one duce ven ou deed ice icserehetadsey adeeb ete Gad eod ease 381 
first-quantization 

7 A Oy (es eee ee eee ae nee eee en oe eer ee cme ee eee eee er ere 382 
Yang-Mills, unitary gauges 

B. Gauges 

different gauges for different uses (else BRST wouldn’t be necessary) 

We EIA sae crt es re si abe doe re eee eer en eae ote eens wee eeaad 386 
for particles in external fields 

aan Bis) aoe ee eee ae ene ee ae eee ere ae ee ee ere ee 389 
covariant class of gauges is simplest; Landau, Fermi-Feynman gauges 

Oe UU gees a be eh een oe a ied dea ies aod gone eae ee ead es 391 
Higgs requires modifications; unitary and renormalizable gauges 

Be WP alsa INGVCU 635.755 ss ees teee so a eb ew aoe okie) Oe 2 eee 393 
special Lorenz gauges that simplify interactions, complex gauges similar 
to lightcone; anti-Gervais-Neveu 

5. Super Gervais-Neveu ............0. 00. c cece nee eee eee 396 
supersymmetry has interesting new features 

Os PPACCCONE sok costes spe ee sync eee ee eee r caRneter Seeeeresseaceeneess 399 
general axial gauges; Wick rotation of lightcone, best gauge for trees; 
lightcone-based simplifications for covariant rules 

1s DUPCtePaACeCONe sy oncsscis evacuator see sed se dises sia cede 403 
supersymmetric rules also useful for nonsupersymmetric theories (like 
QCD) 

8. Background-field .............. 0.0.6 c ec e tenes 406 
class of gauges that simplifies BRST to ordinary gauge invariance for loops 

9:. INielsen-K alos y os 'c520cc52ccdownnacs ex ede es biG oe ep eee ee 412 
methods for more general gauges 

10. Super background-field .............00... 0000 ccc eee eee 415 


again new features for superfields; prepotentials only as potentials 


C. Scattering 


applications to S-matrices 


he WameMile. ..-cenctactetieteueneibeans £1by crea dageepeduasennedees 419 
explicit tree graphs made easy; 4-gluon and 5-gluon trees evaluated 

2 VOCUPSION V4 cede eves sndae ed cece dieeed weeds eee Pe dseesteses 423 
methods for generalizations to arbitrary number of external lines 

Ox POMMNIONS toads 42 nove deces peeressdiedeebeadentarekehehesntukeaees 426 


similar simplifications for high-energy QCD 
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Ai SNEASSES oe crest a te, h Bre nest aah Beh ncestsnes es Tew enti techs ola and a beable std heritage 429 


more complicated trees for massive theories; all 4-point tree amplitudes 


of QED, differential cross sections 


2 (UPECPETAPNS 5 inxs cw una Gar dune ciel eu swae bret tink abet edasees 435 


supersymmetric theories are simpler because of superspace; anticommut- 
ing integrals reduce to algebra of covariant derivatives; explicit locality of 


effective action in anticommuting coordinates implies nonrenormalization 


theorems 
VII. Loops 
General features of higher orders in perturbation theory due to momentum integra- 
tion. 
A. General 
properties and methods 
1. Dimensional renormalization .................0 000. c eee eee 440 
eliminating infinities; method (but not proof); dimensional regularization 
2. Momentum integration............ 000... ee 443 


general method for performing integrals, Beta and Gamma functions 


. Modified subtractions:. (io 5103 d6.4 25 hs ieee erie Sees 447 
schemes: minimal (MS), modified minimal (MS), momentum (MOM) 

« (Optical Theorems sins wired ier sy oe5dp eee ek he eee & 451 
unitarity applied to loops; decay rates 

i POWEr COUNEING so Pacnci es aler tied Ledel.cedmnteitate th accinegitedee aes 453 


how divergent, UV divergences, divergent terms, renormalizability, 
Furry’s theorem 


. Infrared divergences: 2 iscsi oe4cehes phe de wide Ck een ds oad oN 458 


brief introduction to long-range infinities; soft and colinear divergences 


B. Examples 


mostly one loop 


i ES rs fa 9 0 =t- pemrne eine enc eR Nt ee en OT en ene ner eee 462 
simplest examples: one external line, one or more loops, massless and 
massive 

2, Mlective porential.. ic ¢vicenswetasadd aru wey dee eden eves 465 
simplest application — low energy; first-quantization 

3. Dimensional transmutation................00. 000 cece eee 468 
most important loop effect; massless theories can get mass 

4. Massless propagators ............. 0000. e cece ete eee ens A70 


next simplest examples 
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Sh. OSONIZATION 6 66:<306 cach oyea ee edn oes Riddance Gate deaed heads 473 
fermion fields from boson fields in D=2 

G. Massive Ppropatalaras..des.dcr ee thee sd ieee eed badaw chee ae A78 
masses mean dimensional analysis is not as useful 

7. Renormalization group ............... 0000 c cece eens 482 
application of dimensional transmutation; running of couplings at high 
energy 

8. Overlapping divergences................. 0000 c cece eee ee eee 485 
two-loop complications (massless); renormalization of subdivergences 

C. Resummation 

how good perturbation is 

1. Improved perturbation ............... 00... c cece ee 492 
using renormalization group to resum 

2; (ReENOLIMAlONS 5 tig: pd ceen coke nde seve need hone ee A pees eRe bees 497 
how good renormalization is; instantons and IR and UV renormalons 


create ambiguities tantamount to nonrenormalizability 


improving resummation; ambiguities related to nonperturbative vacuum 
values of composite fields 

Mo LJ IN GxXPaNiGlOls 9.244 104ee01 ededsend nes baad Lexdewnered ened osedanene 504 
reorganization of resummation based on group theory; useful at finite 
orders of perturbation; related to string theory; Okubo-Zweig-lizuka rule; 
a solution to instanton ambiguity 


VIII. Gauge loops 
(Mostly) one-loop complications in gauge theories. 
A. Propagators 


QED and QCD 

De DOP ONOM hid ee a eieinbenaneetedwaeee nes hnawardweseeneeemseens 511 
correction to fermion propagator from gauge field 

Be. PWOUOM iS coe 26a ad doy oe eek boon va OEE Bate Lone eed eee eas 514 
correction to gauge propagator from matter 

Se, MMO 3s es a Lda datede ead ated Bae ae OSE ee es A 515 


correction to gauge propagator from self-interaction; total contribution to 
high-energy behavior 

A. Grand Unified Theories.................. 000... c cece ee 521 
3 gauge couplings running to 1 at high energy 

Bx MUPGIMaUl CE 4 uc o vent dooce dea k ede edo ei deieoree tials sedeaetiaus 524 
supergraphs at 1 loop 
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Ge SUPSre UG 51.644) cw eo eGSadd ade dln Chik en nde ee shane ek caches es 527 
finite N=1 supersymmetric theories as solution to renormalon problem 
if) DOAWINGET IMOGEL 4 yess knee wld eins be Slike Dilan ape plate, Su aco aca ans 531 


kinematic “bound” states at one loop (quantum Stiickelberg), axial 
anomaly 


. Low energy 


QED and anomaly effects 


ON ed VV PMc heheh tats echt thts Atti thts ee Mh Ns tka tad Aedes tad 537 
first-quantized approach to 1 loop at low-enregy 

Di PVA ANNAN oe 1 ited ae 8 aa le dea as etalon Gh he ah diay ween eee 540 
classical symmetry broken at one loop 

3.. Anomaly canceéllation :2:7 i253 okskeeenl aie eed ee eee ee tee ks 544 
constraints on electroweak models 

Bhi Gl Dhan dtu tan rv Lekeeutses opp eee wale wh GtieegNladesotdoen etl 546 
application of uncanceled anomaly 

He VELCCKe Acs wel awa wineve eles cnet ae ts ols Cole emeead wed acer taks 548 
one-loop 3-point function in QED 

6. Nonrelativistic JW KBis:< seers eee nari Guse eure dokedier ear eem x Ree 551 
nonrelativistic form of effective action useful for finding Lamb shift (in- 
cluding anomalous magnetic moment), using Foldy-Wouthuysen transfor- 
mation 

AU Ce kG el eae do ee a ee BE een a ers 554 


lattices for nonperturbative QCD; regulator; no gauge fixing; problems 


with fermions; Wilson loop, confinement; nonuniversality 


. High energy 


brief introduction to perturbative QCD 


1. Gontormal: anomaly: : xiiei< ac levedice dese vie te Pha nee Relates Diewiset 561 
relation to asymptotic freedom 

22 Oe = NACTOUS oi Cyan s as sear Deel eatin eee wy eae eee aah 564 
simplest QCD loop application; jets 

Dé PAP LON MOOG :.o:.s0s0ctve uehhaeeun e Oe nah ane sie rok Sis Rane athee nes 566 
factorization and evolution; deep inelastic and Drell-Yan scattering 

4. Maximal supersymmetry ...........0...0 0000s 573 
3- & 4-pt. amplitudes for N=4 supersymmetry 

Os. Hirst Qiantizalion oss: 434523 wee ne ea etait eevee 576 


making use of the worldline for loop calculations 
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... PART THREE: HIGHER SPIN ............... 


General spins. Spin 2 must be included in any complete theory of nature. Higher 


spins are observed experimentally for bound states, but may be required also as 


fundamental fields. 


IX. General relativity 


Treatment closely related to that applied to Yang-Mills, super Yang-Mills, and super- 


gravity. Based on methods that can be applied directly to spinors, and therefore to 


supergravity and superstrings. Somewhat new, but simplest, methods of calculating 


curvatures for purposes of solving the classical field equations. 
A. Actions 


starting point for deriving field equations for gravity (and matter) 


ie 


(GOUPE INVAPIANCE fi i ie4 ends ede eed rieevedssneedecies Sea seads eds 587 
curved (spacetime) and flat (tangent space) indices; coordinate (space- 


time) and local Lorentz (tangent space) symmetries 


. Covariant derivatives ......... 00... n nee ene enees 592 


gauge fields: vierbein (coordinate symmetry) and Lorentz connection; 
generalization of unit vectors used as basis in curvilinear coordinates; 
basis for deriving field strengths (torsion, curvature), matter coupling; 


Killing vectors (symmetries of solutions) 


De CIOMCUMOIS 6 ocx crenek seven dae ore Rhonda keh cken eawenweseveus 598 
gauges, constraints; Weyl tensor, Ricci tensor and scalar 

A> IOST ATION. 4; 4-desd bie edederdeelGedeeveduevevage tatuew ease an ees 601 
measure, invariance, densities 

Ds MP AV NYS cn ere on a edhd chad means tdadieeehelead oak aeaeaedee ck 605 
pure gravity, field equations 

G.. EMerey-MOMeNtUIMN: 2 ocicedced lence ce echo ddde en weeetewedeeedal ees 609 
matter coupling; gravitational energy-momentum 

Vs NWVOVU SCAG feels s2 isc erescstesopehy dni nebened Sha ckeaeed ii bad ee 611 


used later for cosmological and spherically symmetric solutions, gauge 
fixing, field redefinitions, studying conformal properties, generalization to 


supergravity and strings 


B. Gauges 
coordinate and other choices 
is TiQten es tyon2de4 oven ein ose ood Ae Ee ots bade d Hed Sed dda o Vases gee 620 
globally Lorentz covariant gauges, de Donder gauge; perturbation, BRST 
Bs MAOOCMESICS s Se duck ctx dina Meme aaetos < aun R xaath bieed Khare age eke, cea 622 


straight lines as solutions for particle equations of motion; dust 
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De IAL /e Aad toate faethe Caden tala kta Ae Genes eine a means aT es 625 
simplest unitary gauges; lightcone, Gaussian normal coordinates 
Ay TRACI Ab ised eects aks ins ee ihciel 24 baw ets Geen eute Sher ie pad oy 629 


gauges for external fields, Riemann normal coordinates; local inertial 


frame, parallel transport 


¢. WOES TSBGAle ae pater fica si ue walt wbinal Se pie ae aeea Sele wel asa tee 633 


use of Weyl scale invariance to simplify gauges; dilaton; string gauge 


C. Curved spaces 


solutions 

i OC eC BTA Wa. 2h Sant Scie eet eretin aaa et ae anade slp ted alma aa 638 
simplest solutions, waves; lightcone gauge for self-duality 

Do NIG SIE GOR as. for eter Be acter Mod Shade ded eobiueginead ea deaeetle Meee 640 
cosmological term and its vacuum, spaces of constant curvature 

Bs. COSIMOIOG Ys a0 cto es one ee A oe Rae ee Rowe oR DA 642 
the universe, Big Bang 

Aly RCO CSINVEG bose8 brits ite Ae tacts a Mote slot tgPe SOa ae anal ind vol Nha gs 645 
cosmological measurements: Hubble constant, deceleration parameter 

De CH WabZ SOME ty cola on awe tude ok one woud retbmeas pind eles 646 


spherical symmetry; applications of general methods for solving field equa- 


tions; electromagnetism 


2 periment sss sv cipieds ie ea Xk eead yea No Reale ae ey 654 


classic experimental tests: gravitational redshift, geodesics 
aCe GOS nc the Bhar oath ar ad nur cats maabaug coe x .ehpepinadenitse Geer daals 660 
extrapolation of spherically symmetric solutions (Kruskal-Szekeres coor- 


dinates); gravitational collapse, event horizon, physical singularity 


X. Supergravity 


Graviton and spin-3/2 particle (gravitino) from supersymmetry; local supersymmetry. 


A. Superspace 


simplest (yet complicated) method for general applications, especially quan- 


tum 


1. 


Covatiant derivatives: ois eek Pe Ss dae Se 664 
general starting point for gauge theories; R gauge symmetry; constraints, 


solution, prepotentials 


a HIGId -Strene Lbs’. Go.c e124 Devens ei eeenns tenes eevaepeteaes oles Gere 669 
generalization of curvatures; solution of Bianchi (Jacobi) identities 
~/ MOOMPENSAlLOrses Gey Mileage eile ceean buted acid keke ipkad aces 672 


more than one generalization of dilaton; minimal coupling 


A. 
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CAG SAI CS a. conecue eve dna teed iaus ads wnebtecedeedueveneeneedhed 675 
supersymmetric generalization of Weyl scale; transformations of field 


strengths, compensators 


B. Actions 


generalization from global supersymmetry, using superspace, components, 


and compensators 


Ws EPSP AI IG, senetsens tacacen bb edn beneeseupoeweneheceh eawenaha ca6 be 681 
different measures, supergravity action, matter, first-order action 

2. MGCLOPIAG beck erie cies dweeeren diedweanesne ei daaligeceeeceredned 684 
alternative method of integration, geared for components 

3. Component transformations..................0 000. 687 


derivation from superspace 
Component approach................ 00. ccc ened 689 
starting directly from components can be simpler for pure supergravity; 


second-order vs. first-order formulations 


ac, MOVIN coca ec ce he-ceet.a area eden ae na memeenne ries eeetetaeeenees 692 
relations of different formulations, old minimal vs. new minimal 
oe UDG PIIG GS 52 wabuwackbes epiayeeuu averse cnandadeeeRerebaenwateres 695 


mass to the gravitino; analysis using compensators 
MNOS coset cepa ter eee 2 Oce hee sei avern dis aa Wardens, a’sik pie aa doe- aa Rein Sate eke s 698 
simple models with naturally vanishing cosmological constant; sigma- 


model symmetries 


C. Higher dimensions 


useful for superstring and other unifications; extended supergravity 


ie. irae SpinOrs sid Se ato eee he A eee ees 701 
spinors for general orthogonal groups; metrics; also useful for GUTs 

2. Wick TOCAbION, ¢ oc. cded ene nadsceedeabeepucas stated eid stadaeoeaes 704 
generalization to arbitrary signature; sigma matrices, Majorana spinors 

Bee Gols) ag 0) (1) a re ee eee eee eee ie eee eee oe 708 


all types of fields that can appear in supergravity; restrictions on number 


of dimensions 


4, SUPersy MIMIC Y i4054014 0eSiehen dd dene eeeicedeaee ee ee reer 709 
supersymmetry in general dimensions; extended supersymmetry 

Bie HCONICS icc 0d5-oaeeadd dered es sad kkk wees Sed oeRd ta dicedaeed dadade® ville 
examples of supersymmetric and supergravity theories 

6. Reduevion tO D4 ba 5..c.cdscves dd eaderna ss eianeee ens eres eewegns Lo 


D=4 theories from higher dimensions; extended supergravities; S-duality 
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XI. Strings 
Approach to studying most important yet least understood property of QCD: con- 


finement. Other proposed methods have achieved explicit results for only low hadron 


energy. String theory is also useful for field theory in general. 


A. Generalities 


known string theories not suitable for describing hadrons quantitatively, but 


useful models of observed properties; qualitative features of general string 


theories, using dilaton and closed = open ® open 


De, HREGOO PNGCOP 22h Se week cad d uaa d star ee a etedmeae 724 
observed high-energy behavior of hadrons; bound states, s-t duality 

Be VOD OOS 2 24 tsvatits rie Odea, haadealied tee, dived ay ade were eee eases 728 
loop simplifications from geometry; closed from open 

3.. Classical mechanitss sacs oee iets ie as eee ee 733 


action, string tension, Virasoro constraints, boundary conditions 


reality and group properties; twisting; generalizations for massless part 
of theory; supergravity theories appearing in superstrings; string types: 
heterotic, Types I and II 


se ICCTA Vos epee aaa i ocak can erat ee es earn nei ge Catnad ela eeaes Safes 740 


strings unify massless antisymmetric tensors with gravity; transforma- 
tions, O(D,D) 


WAAC OT 22 oe cas te ered eles ees ears Bie a ea se halle ue eal sts es 742 
how it appears in strings and superstrings; constraints on backgrounds; 
S-duality 

TAR ICOS saree tein sae elas wokinele sain wulnoe ates Prelnie era a ae TAT 


discretization of string worldsheet into sum of Feynman diagrams; alter- 


native lattice theories relevant to string theory of hadrons 


B. Quantization 


calculational methods; spectrum; tree graphs 


We RIC OS 1 ad od cel emres Beemer ee seh Parsee need Realwanens Nelgea wat ead 756 
fixing 2D coordinates; conformal and lightcone gauges; open vs. closed 
strings 

2... Quaritum mechanics . 22 ¢. e600 pide beeen obec saree cous wi 761 


mode expansion; spectrum; ghosts 


oR OMEVINIUT AC ONS PO US atl a eds tall aati arcs cana, Uae acai dace 766 
commutators from propagators 
. Conformal transformations............. 000.0066 769 


consequences of conformal invariance of the worldsheet; vertex operators 
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lee IAD op hereaee di cede k ok eee penne) eae ea greater ndsocedteds vere: 
bosonization relates physical fermions and bosons 

6) TGCS i weieiadear ehhh ace hi iiw des ee eremee eee eres 778 
interactions, path integral; Regge behavior, but not parton behavior 

at MG MOSUS acta d-o0.ae bane oe anes day od peed 6 oon en eaan een ee tee 785 
ghosts manifest conformal invariance 

C. Loops 

quantum field theory corrections via first quantization 

i (Partition: MunCtION 4. 6s peeve eana thin doe bas ea ean eee eee Ls 791 
general setup 

2. Jacobi Theta Sanction oi ed ciaiiccaees ccna edeeeraskg deans be en 794 
functional properties for analyzing amplitudes 

Se ASCH -TNMINCHON, 4 4100.010046x56668eeaek ed enesenweeaedeaeeeaamdenes 797 
main part of path integral 

A MO aj conceal eye iaeat dade ¥eedidadlse ga eelee ergata 801 
singularities, including generation of closed strings 

Bt STS 3 Bidder Sa bee Gatton reac @ esa enna wee eae eee Bares eaten toes 806 
singularities, modular invariance 

Gs BUPeh 2 hi css deeb en) pal eSes vee ete Aedes pawn he eats 5 cee tes ea es 810 
supersymmetry, cancellation of divergences 

Ds: PRTMONT ANOS 935 sre Netas ace ep RAG BIER ee ebb eRe RR eS 814 


avoidance of potential problems with 10D supergravity 


XII. Mechanics 


General derivation of free actions for any gauge theory, based on adding equal numbers 


of commuting and anticommuting ghost dimensions. Usual ghost fields appear as 


components of gauge fields in anticommuting directions, as do necessary auxiliary 


fields like determinant of metric tensor in gravity. 
A. OSp(1,1|2) 
enlarged group of BRST, applied to first-quantization 


Wee ONG 2k ce dae de od we ead oe eh ee eee ee ee eka sede eed SEY 819 
BRST based on lightcone formulation of Poincaré group 

Bee, NS Erde eat on een nag eee eee einem esas Fe ariene nes eee 822 
add extra dimensions; nonminimal terms 

Bie de BION aie. s 6:3. onea. e.g aeerdteare ihn aaoee eo Aba ewe he sb pees aes ee 826 
for general spin 

A. SMDOM eats ciaiedarigirusededas tasks toes ied Roewebecank 827 


slight generalization for half-integer spin 
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Har eA APNE 5 04t.a, atl ited O44 see Serle Ped cme ue tee aasbiewte ene baa Melek ol 829 

specialization to usual known results: massless spins 0, s, 1, 3, 2 
B. IGL(1) 

subalgebra is simpler and sufficient; gauge fixing is automatic 

ls AGED uo cn od tt ete cot ees ee te dace eae an ae ees 834 
restriction from OSp(1,1|2) 

2. TACT PPOGWCL ge nui! hha seinen eet aes tee sete ee tale are See aaa tines alg 835 
modified by restriction 

Si AO RIONL 2.052.435 oad pe eee Rete ae PALS Wench ae aeede nae 837 
simpler form, but extra fields 

Al SOUL ION oi sea ore reg cot coh ie we ce aes ce cs ei he ale ay cease le Secs 2 840 
of cohomology; proof of equivalence to lightcone (unitarity) 

Dia OO PINIOES 22 ese cine Seen GAL Ai nek oe ce A bed saat a ies oad tant 843 
modified action; cohomology 

Gs IMTASSOS tp icy esha ese Bi eo ee Pe Pa ee eee 844 
by dimensional reduction; examples: spins 5; 1 

t. wackeround Heldss 4.4 idsewakcieis fo ane ia wens eet ealas 845 
as generalization of BRST operator; vertex operators in Yang-Mills 

Be OUTINGS ec ded ot ene oni tid ten Aetna egy eae hmtadeant cadedahdeandas a dats 847 


as special case; ghost structure from OSp; dilaton vs. physical scalar; 
heterotic string; vertex operators 

9: Relation‘ toe OSp Ul) 2) scents tae ee a ee 852 
proof of equivalence 

C. Gauge fixing 

Fermi-Feynman gauge is automatic 

EP Antibtackehrd closet evita Cease iteki a chee tuobrluy eter used 855 
antifields and antibracket appear naturally from anticommuting coordi- 
nate, first-quantized ghost of Klein-Gordon equation 


Be i EVA hae aot we latamae go rele caadeis che weve Sac u vos teers 858 
equivalence to Zinn-Justin-Batalin-Vilkovisky method 
Bo TOW. ant ee eins pene Kher he atietatas hous odie eine Gin gia cele 862 


relation to field theory BRST 


PIPER WIAD aes 32 Dh td chatting hitched ah Stadt cette certs ok ate edits eke hele 866 
Following the body of the text (and preceding the Index) is the AfterMath, containing 


conventions and some of the more important equations. 


Scientific method 


Although there are many fine textbooks on quantum field theory, they all have 
various shortcomings. /nstinct is claimed as a basis for most discussions of quantum 
field theory, though clearly this topic is too recent to affect evolution. Their subjectiv- 
ity more accurately identifies this as fashion: (1) The old-fashioned approach justifies 
itself with the instinct of intuition. However, anyone who remembers when they first 
learned quantum mechanics or special relativity knows they are counter-intuitive; 
quantum field theory is the synthesis of those two topics. Thus, the intuition in this 
case is probably just habit: Such an approach is actually historical or traditional, 
recounting the chronological development of the subject. Generally the first half (or 
volume) is devoted to quantum electrodynamics, treated in the way it was viewed in 
the 1950’s, while the second half tells the story of quantum chromodynamics, as it 
was understood in the 1970’s. Such a “dualistic” approach is necessarily redundant, 
e.g., using canonical quantization for QED but path-integral quantization for QCD, 
contrary to scientific principles, which advocate applying the same “unified” methods 
to all theories. While some teachers may feel more comfortable by beginning a topic 
the way they first learned it, students may wonder why the course didn’t begin with 
the approach that they will wind up using in the end. Topics that are unfamiliar 
to the author’s intuition are often labeled as “formal” (lacking substance) or even 
“mathematical” (devoid of physics). Recent topics are usually treated there as ad- 
vanced: The opposite is often true, since explanations simplify with time, as the topic 
is better understood. On the positive side, this approach generally presents topics 
with better experimental verification. 

(2) In contrast, the fashionable approach is described as being based on the in- 
stinct of beauty. But this subjective beauty of art is not the instinctive beauty of 
nature, and in science it is merely a consolation. Treatments based on this approach 
are usually found in review articles rather than textbooks, due to the shorter life ex- 
pectancy of the latest fashion. On the other hand, this approach has more imagination 
than the traditional one, and attempts to capture the future of the subject. 

A related issue in the treatment of field theory is the relative importance of con- 
cepts vs. calculations: (1) Some texts emphasize the concepts, including those which 
have not proven of practical value, but were considered motivational historically (in 
the traditional approach) or currently (in the artistic approach). However, many ap- 
proaches that were once considered at the forefront of research have faded into oblivion 
not because they were proven wrong by experimental evidence or lacked conceptual 
attractiveness, but because they were too complex for calculation, or so vague they 
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lacked predicitive ability. Some methods claimed total generality, which they used to 
prove theorems (though sometimes without examples); but ultimately the only useful 
proofs of theorems are by construction. Often a dualistic, two-volume approach is 
again advocated (and frequently the author writes only one of the two volumes): Like 
the traditional approach of QED volume + QCD volume, some prefer concept volume 
+ calculation volume. Generally, this means that gauge theory S-matrix calculations 
are omitted from the conceptual field theory course, and left for a “particle physics” 
course, or perhaps an “advanced field theory” course. Unfortunately, the particle 
physics course will find the specialized techniques of gauge theory too technical to 
cover, while the advanced field theory course will frighten away many students by its 
title alone. 

(2) On the other hand, some authors express a desire to introduce Feynman graphs 
as quickly as possible: This suggests a lack of appreciation of field theory outside of 
diagrammatics. Many essential aspects of field theory (such as symmetry breaking 
and the Higgs effect) can be seen only from the action, and its analysis also leads to 
better methods of applying perturbation theory than those obtained from a fixed set 
of rules. Also, functional equations are often simpler than pictorial ones, especially 
when they are nonlinear in the fields. The result of over-emphasizing the calculations 
is a cookbook, of the kind familiar from some lower-division undergraduate courses 
intended for physics majors but designed for engineers. 

The best explanation of a theory is the one that fits the principles of scientific 
method: simplicity, generality, and experimental verification. In this text we thus 
take a more economical or pragmatic approach, with methods based on efficiency 
and power. Unattractiveness or counter-intuitiveness of such methods become ad- 
vantages, because they force one to accept new and better ways of thinking about 
the subject: The efficiency of the method directs one to the underlying idea. For 
example, although some consider Einstein’s original explanation of special relativity 
in terms of relativistic trains and Lorentz transformations with square roots as be- 
ing more physical, the concept of Minkowski space gave a much simpler explanation 
and deeper understanding that proved more useful and led to generalization. Many 
theories have “miraculous cancellations” when traditional methods are used, which 
led to new methods (background field gauge, supergraphs, spacecone, etc.) that not 
only incorporate the cancellations automatically (so that the “zeros” need not be 
calculated), but are built on the principles that explain them. We place an emphasis 
on such new concepts, as well as the calculational methods that allow them to be 
compared with nature. It is important not to neglect one for the sake of the other, 


artificial and misleading to try to separate them. 
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As aresult, many of our explanations of the standard topics are new to textbooks, 
and some are completely new. For example: 

(1) We derive the Foldy-Wouthuysen transformation by dimensional reduction from 
an analogous one for the massless case (subsections IIB3,5). 

(2) Cosmology is discussed with just the dilaton instead of full general relativity 
(subsection IVA7). With only some minor fudges, this is sufficient to fit the 
post-inflation universe to observations. 

(3) We derive the Feynman rules in terms of background fields rather than sources 
(subsection VC1); this avoids the need for amputation of external lines for S- 
matrices or effective actions, and is more useful for background-field gauges. 

(4) We obtain the nonrelativistic QED effective action, used in modern treatments 
of the Lamb shift (because it makes perturbation easier than the older Bethe- 
Salpeter methods), by field redefinition of the relativistic effective action (sub- 
section VIIIB6), rather than fitting parameters by comparing Feynman diagrams 
from the relativistic and nonrelativistic actions. (In general, manipulations in the 
action are easier than in diagrams.) 

(5) We present two somewhat new methods for solving for the covariant derivatives 
and curvature in general relativity that are slightly easier than all previous meth- 
ods (subsections [IXA2,A7,C5). 

There are also some completely new topics, like: 

(1) the anti-Gervais-Neveu gauge, where spin in U(N) Yang-Mills is treated in al- 
most the same way as internal symmetry — with Chan-Paton factors (subsection 
VIB4); 

(2) the superspacecone gauge, the simplest gauge for QCD (subsection VIB7); and 

(3) anew “(almost-)first-order” superspace action for supergravity, analogous to the 
one for super Yang-Mills (subsection XB1). 

We try to give the simplest possible calculational tools, not only for the above 
reasons, but also so group theory (internal and spacetime) and integrals can be per- 
formed with the least effort and memory. Some traditionalists may claim that the old 
methods are easy enough, but their arguments are less convincing when the order of 
perturbation is increased. Even computer calculations are more efficient when left as 
a last resort; and you can’t see what’s going on when the computer’s doing the calcu- 
lating, so you don’t gain any new understanding. We give examples of (and exercises 
on) these methods, but not exhaustively. We also include more recent topics (or those 
more recently appreciated in the particle physics community) that might be deemed 
non-introductory, but are commonly used, and are simple and important enough to 


include at the earliest level. For example, the related topics of (unitary) lightcone 
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gauge, twistors, and spinor helicity are absent from all field theory texts, and as a 
result no such text performs the calculation of as basic a diagram as the 4-gluon 
tree amplitude. Another missing topic is the relation of QCD to strings through the 
random worldsheet lattice and large-color (1/N) expansion, which is the only known 
method that might quantitatively describe its high-energy nonperturbative behavior 
(bound states of arbitrarily large mass). 

This text is meant to cover all the field theory every high energy theorist should 
know, but not all that any particular theorist might need to know. It is not meant as 
an introduction to research, but as a preliminary to such courses: We try to fill in the 
cracks that often lie between standard field theory courses and advanced specialized 
courses. For example, we have some discussion of string theory, but it is more oriented 
toward the strong interactions, where it has some experimental justification, rather 
than quantum gravity and unification, where its usefulness is still under investigation. 
We do not mention statistical mechanics, although many of the field theory methods 
we discuss are useful there. Also, we do not discuss any experimental results in detail; 
phenomenology and analysis of experiments deserve their own text. We give and apply 
the methods of calculation and discuss the qualitative features of the results, but do 
not make a numerical comparison to nature: We concentrate more on the “forest” 
than the “trees”. 

Unfortunately, our discussions of the (somewhat related) topics of infrared-diver- 
gence cancellation, Lamb shift, and the parton model are sketchy, due to our inability 
to give fully satisfying treatments — but maybe in a later edition? 

Unlike all previous texts on quantum field theory, this one is available for free 
over the Internet (as usual, from arXiv.org and its mirrors), and may be periodically 
updated. Errata, additions, and other changes will be posted on my web page at 
http: //insti.physics.sunysb.edu/“siegel/plan.html until enough are accumulated for a 
new edition. Electronic distribution has many advantages: 

e It’s free. 

e It’s available quickly and easily. You can download it from the arXive.org or its 
mirrors, just like preprints, without a trip to the library (where it may be checked 
out) or bookstore or waiting for an order from the publisher. (If your connection 
is slow, download overnight.) And it won’t go “out of print”. 

e Download it at work, home, etc. (or carry it on a CD), rather than carrying a 
book or printing multiple copies. 

e Get updates just as quickly, rather than printing yet again. 


e It has the usual Web links, so you can get the referenced papers just as easily. 
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e It has a separate “outline” window containing a table of contents on which you 
can click to take the main window to that item. 

e You can electronically search (do a “find” on) the text. 

e Easier to read on the computer screen (arbitrary magnification, etc.) 

e Save trees, ink, and space. 

e Theft is not a problem. 

e No wear or tear. 

e No paper cuts. 

e You can even add notes (far bigger than would fit in the margin) with various 
software programs. 


Highlights 


The preceding Table of Contents lists the three parts of the text: Symmetry, 
Quanta, and Higher Spin. Each part is divided into four chapters, each of which has 
three sections, divided further into subsections. Each section is followed by references 
to reviews and original papers. Exercises appear throughout the text, immediately 
following the items they test: This purposely disrupts the flow of the text, forcing 
the reader to stop and think about what he has just learned. These exercises are 
interesting in their own right, and not just examples or memory tests. This is not a 
crime for homeworks and exams, which at least by graduate school should be about 
more than just grades. 

This text also differs from any other in most of the following ways: 

(1) We place a greater emphasis on mechanics in introducing some of the more ele- 
mentary physical concepts of field theory: 

(a) Some basic ideas, such as antiparticles, can be more simply understood al- 

ready with classical mechanics. 

(b) Some interactions can also be treated through first-quantization: This is suf- 
ficient for evaluating certain tree and one-loop graphs as particles in external 
fields. Also, Schwinger parameters can be understood from first-quantization: 
They are useful for performing momentum integrals (reducing them to Gaus- 
sians), studying the high-energy behavior of Feynman graphs, and finding 
their singularities in a way that exposes their classical mechanics interpreta- 
tion. 

(c) Quantum mechanics is very similar to free classical field theory, by the usual 
“semiclassical” correspondence (“duality”) between particles (mechanics) and 
waves (fields). They use the same wave equations, since the mechanics Hamil- 
tonian or Becchi-Rouet-Stora-Tyutin operator is the kinetic operator of the 
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(3) We 


corresponding classical field theory, so the free theories are equivalent. In 
particular, (relativistic) quantum mechanical BRST provides a simple expla- 
nation of the off-shell degrees of freedom of general gauge theories, and in- 
troduces concepts useful in string theory. As in the nonrelativistic case, this 
treatment starts directly with quantum mechanics, rather than by (first-) 
quantization of a classical mechanical system. Since supersymmetry and 
strings are so important in present theoretical research, it is useful to have a 
text that includes the field theory concepts that are prerequisites to a course 
on these topics. (For the same reason, and because it can be treated so 
similarly to Yang-Mills, we also discuss general relativity.) 

also emphasize conformal invariance. Although a badly broken symmetry, 
fact that it is larger than Poincaré invariance makes it useful in many ways: 
General classical theories can be described most simply by first analyzing 
conformal theories, and then introducing mass scales by various techniques. 
This is particularly useful for the general analysis of free theories, for finding 
solutions in gravity theories, and for constructing actions for supergravity 
theories. 

Spontaneously broken conformal invariance produces the dilaton, which can 
be used in place of general relativity to describe cosmology. 

Quantum theories that are well-defined within perturbation theory are confor- 
mal (“scaling”) at high energies. (A possible exception is string theories, but 
the supposedly well understood string theories that are finite perturbatively 
have been discovered to be hard-to-quantize membranes in disguise nonper- 
turbatively.) This makes methods based on conformal invariance useful for 
finding classical solutions, as well as studying the high-energy behavior of the 
quantum theory, and simplifying the calculation of amplitudes. 

Theories whose conformal invariance is not (further) broken by quantum cor- 
rections avoid certain problems at the nonperturbative level. Thus conformal 
theories ultimately may be required for an unambiguous description of high- 
energy physics. 


make extensive use of two-component (chiral) spinors, which are ubiquitous 


in particle physics: 


(a) 


The method of twistors (more recently dubbed “spinor helicity”) greatly sim- 
plifies the Lorentz algebra in Feynman diagrams for massless (or high-energy) 
particles with spin, and it’s now a standard in QCD. (Twistors are also re- 
lated to conformal invariance and self-duality.) On the other hand, most texts 


still struggle with 4-component Dirac (rather than 2-component Weyl) spinor 
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notation, which requires gamma-matrix and Fierz identities, when discussing 
QCD calculations. 

Chirality and duality are important concepts in all the interactions: Two- 
component spinors were first found useful for weak interactions in the days 
of 4-fermion interactions. Chiral symmetry in strong interactions has been 
important since the early days of pion physics; the related topic of instantons 
(self-dual solutions) is simplified by two-component notation, and general 
self-dual solutions are expressed in terms of twistors. Duality is simplest in 
two-component spinor notation, even when applied to just the electromagnetic 
field. 

Supersymmetry still has no convincing experimental verification (at least not 
at the moment I’m typing this), but its theoretical properties promise to solve 
many of the fundamental problems of quantum field theory. (Although there 
is no experimental evidence for supersymmetry, there is also no experimental 
evidence for the Higgs boson. They are equally important for predictability 
in particle physics, although for one this is seen in perturbation theory, while 
for the other only when attempting to resum it.) It is an element of most 
of the proposed generalizations of the Standard Model. Chiral symmetry is 
built into supersymmetry, making two-component spinors unavoidable. 


(4) The topics are ordered in a more pedagogical manner: 


(a) 


(b) 


Abelian and nonabelian gauge theories are treated together using modern 
techniques. (Classical gravity is treated with the same methods. ) 

Classical Yang-Mills theory is discussed before any quantum field theory. This 
allows much of the physics, such as the Standard Model (which may appeal to 
a wider audience), of which Yang-Mills is an essential part, to be introduced 
earlier. In particular, symmetries and mass generation in the Standard Model 
appear already at the classical level, and can be seen more easily from the 
action (classically) or effective action (quantum) than from diagrams. 

Only the method of path integrals is used for second-quantization. Canonical 
quantization is more cumbersome and hides Lorentz invariance, as has been 
emphasized even by Feynman when he introduced his diagrams. We thus 
avoid such spurious concepts as the “Dirac sea”, which supposedly explains 
positrons while being totally inapplicable to bosons. However, for quantum 
physics of general systems or single particles, operator methods are more 
powerful than any type of first-quantization of a classical system, and path 
integrals are mainly of pedagogical interest. We therefore “review” quantum 
physics first, discussing various properties (path integrals, S-matrices, unitar- 
ity, BRST, etc.) in a general (but simpler) framework, so that these properties 
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need not be rederived for the special case of quantum field theory, for which 
path-integral methods are then sufficient as well as preferable. 


(5) Gauge fixing is discussed in a way more general and efficient than older methods: 


(a) 


(c) 


The best gauge for studying unitarity is the (unitary) lightcone gauge. This 
rarely appears in field theory (and gravity) texts, or is treated only half way, 
missing the important explicit elimination of all unphysical degrees of free- 
dom. 

Ghosts are introduced by BRST symmetry, which proves unitarity by showing 
equivalence of convenient and manifestly covariant gauges to the manifestly 
unitary lightcone gauge. It can be applied directly to the classical action, 
avoiding the explicit use of functional determinants of the older Faddeev- 
Popov method. It also allows direct introduction of more general gauges 
(again at the classical level) through the use of Nakanishi-Lautrup fields 
(which are omitted in older treatments of BRST), rather than the functional 
averaging over Landau gauges required by the Faddeev-Popov method. 

For nonabelian gauge theories the background field gauge is a must. It makes 
the effective action gauge invariant, so Slavnov-Taylor identities need not be 
applied to it. Beta functions can be found from just propagator corrections. 


(6) Dimensional regularization is used exclusively (with the exception of one-loop 


axial anomaly calculations): 


(a) 


(b) 


It is the only one that preserves all possible symmetries, as well as being the 
only one practical enough for higher-loop calculations. 

We also use it exclusively for infrared regularization, allowing all divergences 
to be regularized with a single regulator (in contrast, e.g., to the three regu- 
lators used for the standard treatment of Lamb shift). 

It is good not only for regularization, but renormalization (“dimensional 
renormalization” ). For example, the renormalization group is most simply de- 
scribed using dimensional regularization methods. More importantly, renor- 
malization itself is performed most simply by a minimal prescription implied 
by dimensional regularization. Unfortunately, many books, even among those 
that use dimensional regularization, apply more complicated renormalization 
procedures that require additional, finite renormalizations as prescribed by 
Slavnov-Taylor identities. This is a needless duplication of effort that ignores 
the manifest gauge invariance whose preservation led to the choice of dimen- 
sional regularization in the first place. By using dimensional renormalization, 
gauge theories are as easy to treat as scalar theories: BRST does not have to 
be applied to amplitudes explicitly, since the dimensional regularization and 


renormalization procedure preserves it. 
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(7) Perhaps the most fundamental omission in most field theory texts is the expansion 
of QCD in the inverse of the number of colors: 

(a) It provides a gauge-invariant organization of graphs into subsets, allowing 
simplifications of calculations at intermediate stages, and is commonly used 
in QCD today. 

(b) It is useful as a perturbation expansion, whose experimental basis is the 
Okubo-Zweig-lizuka rule. 

(c) At the nonperturbative level, it leads to a resummation of diagrams in a way 
that can be associated with strings, suggesting an explanation of confinement. 

(8) Our treatment of gravity is closely related to that applied to Yang-Mills theory, 
and differs from that of most texts on gravity: 

(a) We emphasize the action for deriving field equations for gravity (and matter), 
rather than treating it as an afterthought. 

(b) We make use of local (Weyl) scale invariance for cosmological and spherically 
symmetric solutions, gauge fixing, field redefinitions, and studying conformal 
properties. In particular, other texts neglect the (unphysical) dilaton, which 
is crucial in such treatments (especially for generalization to supergravity and 
strings). 

(c) While most gravity texts leave spinors till the end, and treat them briefly, 
our discussion of gravity is based on methods that can be applied directly to 
spinors, and therefore to supergravity and superstrings. 

(d) Our methods of calculating curvatures for purposes of solving the classical 
field equations are somewhat new, but probably the simplest, and are directly 


related to the simplest methods for super Yang-Mills theory and supergravity. 


Notes for instructors 


This text is intended for reference and as the basis for a course on relativistic 
quantum field theory for second-year graduate students. The first two parts were 
repeatedly used for a one-year course I taught at Stony Brook. (There is more there 
than can fit comfortably into one year, so I skipped some subsections, but my choice 
varied.) It also includes material I used for a one-semester relativity course, and for 
my third of a one-year string course, both of which I also gave several times here — 


I used most of the following: 
relativity: IA, B3, C2; ITA; IITA-C5; IVA7; VIB1; IX; XIA3, A5-6, B1-2 


strings: IIB1-2; VILA2, B5, C4; VIIIB2, 4-5; XI; XIIA2, B1-3, B8 
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The prerequisites (for the quantum field theory course) are the usual graduate 
courses in classical mechanics, classical electrodynamics, and quantum mechanics. 
For example, the student should be familiar with Hamiltonians and Lagrangians, 
Lorentz transformations for particles and electromagnetism, Green functions for wave 
equations, SU(2) and spin, and Hilbert space. Unfortunately, I find that many second- 
year graduate students (especially many who got their undergraduate training in the 
USA) still have only an undergraduate level of understanding of the prerequisite 
topics, lacking a working knowledge of action principles, commutators, creation and 
annihilation operators, etc. While most such topics are briefly reviewed here, they 
should be learned elsewhere. 

Generally students need to be prepared to begin research at the beginning of their 
third year. This means they have to begin preparation for research in the middle of 
their second year, so standard courses for high-energy theorists, such as quantum 
field theory (and maybe even string theory), should already be finished by then. This 
is rather difficult, considering that quantum field theory is usually considered a one- 
year course that follows one-year prerequisites. The best solution would be to improve 
undergraduate courses, making them less repetitive, so first-semester graduate courses 
could be eliminated. An easier fix would be to make graduate courses more efficient, 
or at least better coordinated and more modern. For example: 

(1) Sometimes relativistic quantum mechanics is taught in the second semester of 
quantum mechanics. If this were done consistently, it wouldn’t need to be treated 
in the quantum field theory course. 

(2) The useful parts of classical electrodynamics are covered in the first semester. (Do 
all physicists really need to learn wave guides?) This is especially true if methods 
for solving wave equations (special functions, radiation, etc.) are not covered 
twice, once in quantum mechanics and once in electromagentism. Furthermore, 
we now know (since the early 20th century) that electromagnetism is not the 
only useful classical field theory: Why not have a one-year course on classical 
field theory, covering not only electromagnetism, but also Yang-Mills and general 
relativity? 

(3) A lot of the important concepts in the Standard Model (especially the electroweak 
interactions) are essentially classical: spontaneous symmetry breaking, the Higgs 
effect, tree graphs, etc. They could be covered as a third semester of classical 
field theory. 

(4) Meanwhile, true quantum field theory (quantization, loops, etc.) could become a 


third semester of quantum theory, taken in parallel with the Standard Model. 
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(5) Much of string theory is mechanics, not field theory. A string theory course could 
begin in the first semester of the second year (classical and statistical mechanics 
having been covered in the first year). 


In summary, a curriculum for high-energy theorists could look something like... 


sem. Mechanics Classical fields Quantum 

1 Classical mechanics Actions & symmetries Quantum theory 

2 Statistical mechanics Yang-Mills & gravity Solving wave equations 
3 Strings Standard Model Quantum field theory 


followed by more advanced courses (e.g., more quantum field theory or strings). An 
alternative is to start quantum field theory in the second semester of the first year. 

Unfortunately, in most places students start quantum field theory in their second 
year, having had little relativistic quantum mechanics and no Yang-Mills, so those 
subjects will comprise the first semester of the “quantum” field theory course, while 
the true quantum field theory will wait till the second semester of that year. 

To fit these various scenarios, the ordering of the chapters is somewhat flexible: 
The “flow” is indicated by the following “3D” plot: 


classical = quantum 
lower spin - oe i ee ae 
ose 
Ne 
1 1X XI 
higher spin 
x XII 


where the 3 dimensions are spin (“7”), quantization (“h”), and statistics (“s”): The 
three independent flows are down the page, to the right, and into the page. (The third 
dimension has been represented as perpendicular to the page, with “higher spin” in 
smaller type to indicate perspective, for legibility.) To present these chapters in the 
1 dimension of time we have classified them as jhs, but other orderings are possible: 


jhs: ITIIIIIV V VI VII VII IX X XI XII 

jsh: III V VILILIV VI VII IX XI X XII 

Ajs: ITIUIULIVIX X V VI XI XI VU VU 

Asj: TIUIIXIV X V XI VI XI VU VOU 

sjh: IIIV VILIIX XIIIV VI VU X XI 

shj: IMILIX V XI VILILIV X VI XU VOI 
(However, the spinor notation of II is used for discussing instantons in II, so some 
rearrangement would be required, except in the jhs, hjs, and hsj cases.) For exam- 
ple, the first half of the course can cover all of the classical, and the second quantum, 
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dividing Part Three between them (hjs or hsj). Another alternative (jsh) is a one- 
semester course on quantum field theory, followed by a semester on the Standard 
Model, and finishing with supergravity and strings. Although some of these (espe- 
cially the first two) allow division of the course into one-semester courses, this should 
not be used as an excuse to treat such courses as complete: Any particle physics 
student who was content to sit through another entire year of quantum mechanics in 
graduate school should be prepared to take at least a year of field theory. 


Notes for students 


Field theory is a hard course. (If you don’t think so, name me a harder one at this 
level.) But you knew as an undergraduate that physics was a hard major. Students 
who plan to do research in field theory will find the topic challenging; those with less 
enthusiasm for the topic may find it overwhelming. The main difference between field 
theory and lower courses is that it is not set in stone: There is much more variation in 
style and content among field theory courses than, e.g., quantum mechanics courses, 
since quantum mechanics (to the extent taught in courses) was pretty much finished 
in the 1920’s, while field theory is still an active research topic, even though it has had 
many experimentally confirmed results since the 1940’s. As a result, a field theory 
course has the flavor of research: There is no set of mathematically rigorous rules to 
solve any problem. Answers are not final, and should be treated as questions: One 
should not be satisfied with the solution of a problem, but consider it as a first step 
toward generalization. The student should not expect to capture all the details of 
field theory the first time through, since many of them are not yet fully understood 
by people who work in the area. (It is far more likely that instead you will discover 
details that you missed in earlier courses.) And one reminder: The only reason for 
lectures (including seminars and conferences) is for the attendees to ask questions 
(and not just in private), and there are no stupid questions (except for the infamous 
“How many questions are on the exam?”). Only half of teaching is the responsibility 
of the instructor. 

Some students who have a good undergraduate background may want to begin 
graduate school taking field theory. That can be difficult, so you should be sure you 
have a good understanding of most of the following topics: 

(1) Classical mechanics: Hamiltonians, Lagrangians, actions; Lorentz transforma- 
tions; Poisson brackets 

(2) Classical electrodynamics: Lagrangian for electromagnetism; Lorentz transfor- 
mations for electromagnetic fields, 4-vector potential, 4-vector Lorentz force law; 


Green functions 
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(3) Quantum mechanics: coupling to electromagnetism; spin, SU(2), symmetries; 
Green functions for Schrodinger equation; Hilbert space, commutators, Heisen- 
berg and Schrodinger pictures; creation and annihilation operators, statistics 
(bosons and fermions); JWKB expansion 

It is not necessary to be familiar with all these topics, and most will be briefly 

reviewed, but if most of these topics are not familiar then there will not be enough 

time to catch up. A standard undergraduate education in these three courses is not 
enough. 
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Version 2 


This update contains no new topics or subsections, but many small changes 
(amounting to a 10% increase in size): corrections, improved explanations, exam- 
ples, (20% more) exercises, figures, references, cosmetics (including more color), and 
an expanded Outline and AfterMath. There are also a few small additions, such as a 
more fundamental explanation of causality and unitarity in quantum mechanics, and 
the use of Weyl scaling as a general method for spherical (as well as cosmological) 
solutions to Einstein’s equations. It now TeX’s with either ordinary TeX or pdftex. 
(PDF figures can be created from PS with ghostscript; also available at my web site.) 


September 19, 2002 


Version 3 


The size has increased another 10%. I added a new subsection on cosmology using 
just the dilaton instead of general relativity, one on 1-loop graphs in N=4 Yang-Mills, 
another on first-quantization in gauge theory, and a whole section on string loops. I 
fixed some EPS figures so they come out as bright as the PDF ones (and smaller files 
than before). Also, more figures, exercises, references, etc. (PDF can be created from 
EPS natively with Mac OS X now.) 


August 23, 2005 


Traditional, leaning toward concepts 
Canonically quantize QED and calculate, then introduce path integrals 


1 S. Weinberg, The quantum theory of fields, 3 v. (Cambridge University, 1995, 
1996, 2000) 609+489+419 pp.: 
First volume QED; second volume contains many interesting topics; third volume 
supersymmetry (but no 2-component spinors, and only 11 pg. on supergraphs, 
non-gauge). By one of the developers of the Standard Model. 

2 M. Kaku, Quantum field theory: a modern introduction (Oxford University, 1993) 
785 pp.: 
Includes introduction to supergravity and superstrings. 

3 C. Itzykson and J.-B. Zuber, Quantum field theory (McGraw-Hill, 1980) 705 pp. 
(but with lots of small print): 
Emphasis on QED. 

4 N.N. Bogoliubov and D.V. Shirkov, Introduction to the theory of quantized fields, 
3rd ed. (Wiley, 1980) 620 pp.: 
Ahead of its time (1st English ed. 1959): early treatments of path integrals, causal- 
ity, background fields, and renormalization of general field theories; but before 
Yang-Mills and Higgs. 


Traditional, leaning toward calculations 
Emphasis on Feynman diagrams 


5 M.E. Peskin and D.V. Schroeder, An introduction to quantum field theory 
(Perseus, 1995) 842 pp.: 
Comprehensive; style similar to Bjorken and Drell. 

6 B. de Wit and J. Smith, Field theory in particle physics, v. 1 (Elsevier Science, 
1986) 490 pp.: 
No Yang-Mills or Higgs (but wait till v. 2, due any day now...). 

7 A.l. Akhiezer and V.B. Berestetskii, Quantum electrodynamics (Wiley, 1965) 868 
pp-: 
Numerous examples of QED calculations. 

8 R.P. Feynman, Quantum electrodynamics: a lecture note and reprint volume 
(Perseus, 1961) 198 pp.: 
Original treatment of quantum field theory as we know it today, but from me- 
chanics; includes reprints of original articles (1949). 
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Modern, but somewhat specialized 
Basics, plus thorough treatment of an advanced topic 


9 J. Zinn-Justin, Quantum field theory and critical phenomena, 4th ed. (Clarendon, 

2002) 1074 pp.: 
First 1/2 is basic text, with interesting treatments of many topics, but no S-matrix 
examples or discussion of cross sections; second 1/2 is statistical mechanics. 

10 G. Sterman, An introduction to quantum field theory (Cambridge University, 
1993) 572 pp.: 
First 3/4 can be used as basic text, including S-matrix examples; last 1/4 has 
extensive treatment of perturbative QCD, emphasizing factorization. (Weak in- 
teractions are in an appendix.) 


Modern, but basic: few S-matrix examples 
Should be supplemented with a “QED/particle physics text” 


11 L.H. Ryder, Quantum field theory, 2nd ed. (Cambridge University, 1996) 487 pp.: 
Includes introduction to supersymmetry. 

12 D. Bailin and A. Love, Introduction to gauge field theory, 2nd ed. (Institute of 
Physics, 1993) 364 pp.: 
All the fundamentals. 

13 P. Ramond, Field theory: a modern primer, 2nd ed. (Perseus, 1989) 329 pp.: 
Short text on QCD: no weak interactions. 


Advanced topics 


For further reading; including brief reviews of some standard topics 


14 Theoretical Advanced Study Institute in Elementary Particle Physics (TASI) pro- 
ceedings, University of Colorado, Boulder, CO (World Scientific): 
Annual collection of summer school lectures on recent research topics. 

15 W. Siegel, Introduction to string field theory (World Scientific, 1988), hep- 
th/0107094, 244 pp.: 

Reviews lightcone, BRST, gravity, first-quantization, spinors, twistors, strings; 
besides, I like the author. 

16 S.J. Gates, Jr., M.T. Grisaru, M. Roéek, and W. Siegel, Superspace: or one 
thousand and one lessons in supersymmetry (Benjamin/Cummings, 1983), hep- 
th/0108200, 548 pp.: 

Covers supersymmetry, spinor notation, lightcone, Stiickelberg fields, gravity, 
Weyl scale, gauge fixing, background-field method, regularization, and anoma- 
lies; same author as previous, plus three other guys whose names sound familiar. 
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PART ONE: SYMMETRY 


The first four chapters present a one-semester course on “classical field theory’. 
Perhaps a more accurate description would be “everything you should know before 


learning quantum field theory”. 


One of the most important and fundamental principles in physics is symmetry. 
A symmetry is a transformation (a change of variables) under which the laws of 
nature do not change. It places strong restrictions on what kinds of objects can exist, 
and how they can interact. When dynamics are described by an action principle 
(Lagrangian, Hamiltonian, etc.), as required by quantum mechanics (but also useful 
classically), continuous symmetries are equivalent to conservation laws, which are the 
sole content of Newton’s laws. In particular, local (“gauge”) symmetries, which allow 
independent transformations at each coordinate point, are basic to all the fundamental 
interactions: All the fundamental forces are mediated by particles described by Yang- 


Mills theory and its generalizations. 


From a practical viewpoint, symmetry simplifies calculations by relating different 
solutions to equations of motion, and allowing these equations to be written more 


concisely by treating independent degrees of freedom as a single entity. 


Part One is basically a study of global and local symmetries: Classical dynamics 
represents only a certain limit of quantum dynamics, and not the one usually em- 
phasized, but most of the symmetries of classical physics survive quantization. The 
phenomenon of symmetry breaking, and the related mechanisms of mass generation, 
can also be seen at the classical level. In perturbative quantum field theory, classical 


field theory is simply the leading term in the perturbation expansion. 


Note that “global” (time-, and usually space-independent) symmetries can elim- 
inate a variable, but not its time derivative. For example, translation invariance 
allows us to fix (i.e., eliminate) the position of the center of mass of a system at some 
initial time, but not its time derivative, which is just the total momentum, whose 
conservation is a consequence of that same symmetry. A local symmetry, being time 
dependent, may allow the elimination of a variable at all times: The existence of this 


possibility depends on the dynamics, and will be discussed later. 


Of particular interest are ways in which symmetries can be made manifest. Fre- 
quently in the literature “manifest” is used vacuously; a “manifest symmetry” is an 
obvious one: If you know the group, the representation under consideration doesn’t 


need to be stated, but can be seen from just the notation. (In fact, one of the main 
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uses of index notation is just to manifest the symmetry.) Formulations where global 
and local symmetries are manifest simplify calculations and their results, as well as 
clarifying their meaning. 

One of the main uses of manifest symmetry is rarely needing to explicitly perform 
a specific symmetry transformation. For example, one might need to examine a rela- 
tivistic problem in different Lorentz frames. Rather than starting with a description 
of the problem in one frame, and then explicitly transforming to another, it is much 
simpler to start with a manifestly covariant description, make one choice of frame, 
then make another choice of frame. One then never uses the messy square roots of the 
familiar Lorentz contraction factors (although they may appear at the end from kine- 
matic constraints). A more extreme example is the corresponding situation for local 
symmetries, where such transformations are intractable in general, and one always 


starts with the manifestly covariant form. 
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In the first chapter we study symmetry in general, concentrating primarily on 
spacetime symmetries, but also discussing general properties that will have other 


applications in the following chapter. 


nid beetei hi Reodyavetate A. COORDINATES ........................ 


In this section we discuss the Poincaré (and conformal) group as coordinate trans- 
formations. This is the simplest way to represent it on the physical world. In later 


sections we find general representations by adding spin. 


1. Nonrelativity 


We begin by reviewing some general properties of symmetries, including as an 
example the symmetry group of nonrelativistic physics. Symmetries are the result of a 
redundant, but useful, description of a theory. (Note that here we refer to symmetries 
of a theory, not of a solution to the theory.) For example, translation invariance says 
that only differences in position are measurable, not absolute position: We can’t 


measure the position of the “origin”. There are three ways to deal with this: 


(1) Keep this invariance, and the corresponding redundant variables, which allows all 


particles to be treated equally. 


(2) Choose an origin; i.e., make a “choice of coordinates”. For example, place an 


object at the origin; i.e., choose the position of an object at a certain time to be the 
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origin. We can use translational invariance to fix the position of any one particle 
at a given time, but not the rest: The differences in position are “translationally 
invariant”. In this example, for N particles there are 3N coordinates describing 
the particles, but still only 3 translations: The particles interact in the same 


3-dimensional space. 


(3) Work only in terms of the differences of positions themselves as the variables, al- 
lowing a symmetric treatment of the particles in terms of translationally invariant 
variables: However, in this example this would require applying constraints on 
the variables, since there are 3N(N—1)/2 differences, of which only 3(N—1) are 


independent. 


Although the last choice is most physical, the first is usually most convenient: The 
use of redundant variables, together with symmetry, often gives a simpler description 
of a theory. We will find similar features later for “local” invariances: In general, the 
most convenient description of a theory is with the invariance; the invariance can then 
be fixed, or invariant combinations of variables used, appropriately for the particular 


application. 


Exercise IA1.1 
Consider a system of objects labeled by the index J, each object located at 
position x;. (For simplicity, we can consider one spatial dimension, or just 
ignore an index labeling the different directions.) Because of translational 
invariance 


vp =a t+ 6x 
where 6x is a constant independent of J, we are led to define new variables 
Oe ts Ot ees Oe 


invariant under the above symmetry. But these are not independent, satisfy- 
ing 
fry = LH; Lrg t+ tjK +£K1 = 0 

for all J, J, K. Start with x,;; as fundamental instead, and show that the 
solution of these constraints is always in terms of some derived variables x, 
as in our original definition. (Hint: What happens if we define 7; = 0?) 
The appearance of a new invariance upon solving constraints in terms of new 
variables is common in physics: e.g., the gauge invariance of the potential 


upon solving the source-free half of Maxwell’s equations. 


Another example is quantum mechanics, where the arbitrariness of the phase of 


the wave function can be considered a symmetry: Although quantum mechanics can 
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be reformulated in terms of phase-invariant probabilities, currents, or density matri- 
ces instead of wave functions, and this can be useful for some purposes of exposing 
physical properties, formulating and solving the Schrodinger equation is simpler in 
terms of the wave function. The same applies to “local” symmetries, where there 
is an independent symmetry at each point of space and time: For example, quarks 
and gluons have a local “color” symmetry, and are not (yet) observed independently 
in nature, but are simpler objects in terms of which to describe strong interactions 
than the observed hadrons (protons, neutrons, etc.), which are described by color- 
invariant products of quark/gluon wave functions, in the same way that probabilities 


are phase-invariant products of wave functions. 


(Note that in quantum mechanics there is a subtle distinction between observed 
and observer that can obscure this symmetry if the observer is not invariant under 
it. This can always be avoided by choosing to define the observer as invariant: For 
example, the detection apparatus can be included as part of the quantum mechanical 
system, while the observer can be defined as some “remote” recorder, who may be 
abstracted as even being translationally invariant. In practice we are less precise, and 
abstract even the detection apparatus to be invariant: For example, we describe the 
scattering of particles in terms of the coordinates of only the particles, and deal with 


the origin problem as above in terms of just those coordinates. ) 


In the Hamiltonian approach to mechanics, both symmetries and dynamics can 
be expressed conveniently in terms of a “bracket”: the Poisson bracket for classi- 
cal mechanics, the commutator for quantum mechanics. In this formulation, the 
fundamental variables (operators) are some set of coordinates and their canonically 
conjugate momenta, as functions of time. The (Heisenberg) operator approach to 
quantum mechanics then is related to classical mechanics by identifying the semiclas- 
sical limit of the commutator as the Poisson bracket: For any functions A and B of 


p and q, the quantum mechanical commutator is 


0A 0B OB 2A) O(n’) 


AR BAS |. = 
. & ag™ Op, dg™ 


where all terms are generated by re-ordering. (For example, if we define “normal 
ordering” in A and B by putting all q’s to the left of all p’s, then doing so in the 
products will lead to an automatic cancellation of the “classical” terms, with all the 
original p’s and q’s.) In other words, the true classical limit of AB — BA is zero, since 


classically functions commute; thus the semiclassical limit is defined by 


h—-0 


lim . AB—BA 
i ] 
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(which is really a derivative with respect to h). We therefore define the bracket for 


the two cases by 


—1 ( cage — op Us ) semiclassically 
[A, B] = Opm OG" ~— OPm OY 
AB-—BA quantum mechanically 


The semiclassical definition of the bracket then can be applied to classical physics 
(where it was originally discovered). Classically A and B are two arbitrary functions 
of the coordinates g and momenta p; in quantum mechanics they can be arbitrary 


“7” in the classical normalization so the two agree 


operators. We have included an 
in the semiclassical limit. We generally use (natural/Planck) units h = 1, so mass 
is measured as inverse length, etc. (In fact, proposals have been made to fix the 
value of h by definition, and then determine the value of the kilogram by exper- 
imental apparatus such as the “watt balance”, rather than relying on a cylinder 
somewhere in Paris.) When we do use an explicit h, it is a dimensionless parameter, 
and appears only for defining Jeffries- Wentzel-Kramers-Brillouin (JWKB) expansions 


or (semi)classical limits. 


Our indices may appear either as subscripts or superscripts, with preferences to 
be explained later: For nonrelativistic purposes we treat them the same. We also use 
the Einstein summation convention, that any repeated index in a product is summed 


over (“contracted”); usually we contract a superscript with a subscript: 


AR. = > A°B., 


m 


The definition of the bracket is equivalent to using 
[Pm q"] = —10;, 


(where 6” is the “Kronecker delta function”: 1 if m =n, 0 if m 4 n) together with 


the general properties of the bracket 
[A; 3) =—|B, Al; [A, B]' = -[At, Bt 
[[A, B], C] + [[B, C], A] + [[C, A], B] = 0 
[A, BC] = [A, BJC + BIA, C] 


The first set of identities exhibit the antisymmetry of the bracket; next are the “Ja- 
cobi identities”. In the last identity the ordering is important only in the quantum 


mechanical case: In general, the difference between classical and quantum mechanics 


A. COORDINATES 43 


comes from the fact that in the quantum case operator reordering after taking the 


commutator results in multiple commutators. 


Infinitesimal symmetry transformations are then written as 
6A = iG, A], A'=A+06A 


where G is the “generator” of the transformation. More explicitly, infinitesimal gen- 


erators will contain infinitesimal parameters: For example, for translations we have 
G=én, => és' =iG,2']=e, dp,=0 
where ¢’ are infinitesimal numbers. 

As we'll see later (subsection IA3), the bracket of any two generators of infinites- 
imal transformations is also an infinitesimal transformation. Thus, any symmetry 
group defines an algebra whose properties follow from the above general properties 
of the bracket. 

The most evident physical symmetries are those involving spacetime. For nonrel- 
ativistic particles, these symmetries form the “Galilean group”: For the free particle, 
those infinitesimal transformations are linear combinations of 


Pi 
2m’ 


M=m, R=p, Jig = Uppy =Vipj —Tjpi, L=H= V, = mz; — pit 


in terms of the position x’ (7 = 1,2,3), momenta p;, and (nonvanishing) mass m, 
where |7j] means to antisymmetrize in those indices, by summing over all permuta- 
tions (just two in this case), with plus signs for even permutations and minus for odd. 
(In three spatial dimensions, one often writes J; = SE ijkI jk to make J into a vector, 
where ¢€ is totally antisymmetric in its indices and €;23 = 1. This is a peculiarity of 
three dimensions, and will lose its utility once we consider relativity in four spacetime 
dimensions.) These transformations are the space translations (momentum) P, rota- 
tions (angular momentum — just orbital for the spinless case) J, time translations 
(energy) FE, and velocity transformations (“Galilean boosts”) V. (The mass M is not 


normally associated with a symmetry, and is not conserved relativistically.) 


Exercise [A1.2 
Let’s examine the Galilean group more closely. Using just the relations for 
[x, p| and [A, BC] (and the antisymmetry of the bracket): 


a Find the action on x; of each kind of infinitesimal Galilean transformation. 


b Show that the nonvanishing commutation relations for the generators are 


[Jigs Pal = t5ipPy, [Jigs Vel = ta Viq, [ig ST) = 1! 
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For more than one free particle, we introduce an m, x’, and p; for each particle (but 
the same t), and the generators are the sums over all particles of the above expressions. 
If the particles interact with each other the expression for H is modified, in such a 
way as to preserve the commutation relations. If the particles also interact with 
dynamical fields, field-dependent terms must be added to the generators. (External, 
nondynamical fields break the invariance. For example, a particle in a Coulomb 


potential is not translation invariant since the potential is centered about some point.) 


Exercise IA1.3 
Show that the Hamiltonian 


a= PE) a ote 2s 


preserves the algebra of exercise [A1.2 for the Galilean group, where the other 
generators are modified only by summing over the index “J” labeling each 
particle. (There are also implicit sums over the usual vector index “i”; U is 


a function of coordinate differences for each I and J.) 


The rotations (or at least their “orbital” parts) and space translations are exam- 
ples of coordinate transformations. In general, generators of coordinate transforma- 


tions are of the form 
G=X{(c)p, => 66(x) =i[G, | = 0d 


where 0; = 0/0x' and ¢(2) is a “scalar field” (or “spin-0 wave function”), a function 


of only the coordinates. 


In classical mechanics, or quantum mechanics in the Heisenberg picture, time 


development also can be expressed in terms of the Hamiltonian using the bracket: 


d O O 
—A=/—+71H,A| =—A+i/H,A 
dt Pan | a 
(The middle expression with the commutator of 0/0t makes sense only in the quantum 
case, and is not defined for the Poisson bracket.) Again, this general relation is 
equivalent to the special cases, which in the classical limit are Hamilton’s equations 
of motion: re ne 4 on 

qd ; Pm ; 

dt i 4 ? dt a »D Oq™ 
The Hamiltonian has no explicit time dependence in the absence of time-dependent 


nondyamical fields (external potentials whose time dependence is fixed by hand, 


A. COORDINATES 45 


rather than by introducing the fields and their conjugate variables into the Hamilto- 
nian). Consequently, time development is itself a symmetry: Time translations are 
generated by the Hamiltonian; the 0/Ot term in d/dt term can be dropped when 
acting on operators without explicit time dependence. 


Invariance of the theory under a symmetry means that the equations of motion 
are unchanged under the transformation: 


dA\' dA’ 
dt} — dt 
To apply our above translation of infinitesimal transformations into bracket language, 


we define 6(d/dt) by 
d d d 
s(4a) =5(4) 4 400 


In the quantum case we can write 


(a)-[lede | 


which follows from the Jacobi identity using B = iG and C = 0/ot+iH, and inserting 
A into the blank spaces of the commutators above. (The classical case can be treated 
similarly, except that the time derivatives are not written as brackets.) We then find 


that the generator of a symmetry transformation is conserved (constant), since 


0-5(5) = iF - [6 | =o Ee | 


Exercise [A1.4 


Show that the generators of the Galilean group are conserved: 
a Use the relation d/dt = 0/0t+7[H, | for the hamiltonian H of a free particle. 


b Solve the equations of motion for x(t) and p(t) in terms of initial conditions, 
and substitute into the expression for the generators to give an independent 
derivation of their time independence. 


Note that in the case where the Galilean symmetry persists for interacting mul- 
tiparticle systems, (total) mass is conserved. In particular, invariance under transla- 


tions and velocity transformations implies mass conservation. 


In the cases where time dependence is not involved, symmetries can be treated 
in almost exactly the same way either classically or quantum mechanically using the 
corresponding bracket (Poisson or commutator), by using the properties that they 
have in common. In particular, the fact that a symmetry generator G = \'"(xr)pm is 


conserved means that we can solve for a component of p in terms of the constant G, 
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and substitute the result into the remaining equations of motion, and that the con- 
jugate to that component doesn’t appear in H. For example, translation invariance 
of a potential in a particular direction means that component of the momentum is a 
constant (dp,/dt = —O0H/0dq' = 0), rotational invariance about some axis means that 


component of angular momentum is a constant (dJ/dt = —OH/06 = 0), etc. 


2. Fermions 


As we learned in our quantum mechanics course, two particles of the same type 
are indistinguishable. Furthermore, while an arbitrary number of bosons (particles 
satisfying Bose-Einstein statistics) can each exist in the same one-particle state, only 
one (or zero) fermions can exist in the same one-particle state. (For example, we can 
have a state consisting of 17 photons each of the same momentum and each of the 
same polarization, and we can’t tell which is which, but we can only have 1 electron 
in such a state.) In terms of wave functions, e.g., a 2-particle wave function, made 
from 1-particle wave functions of the form 7;(x) (where x labels the spatial position 


and 7 other properties), we conveniently define 


bosons: Wi (x, x’) = + Wii (2", x) 


fermions: Wy (x, 2’) = —Wy;(x", x) 


For « = wv’ andi = 7’ the signs (which could be phases, but are chosen real for 
convenience) are chosen so ~;;(a,2) vanishes for fermions but not necessarily for 
bosons, so no 2 fermions are in the same state. For other cases the relation avoids 
double counting for the 2 particles being switched; the signs are arbitrary, but are 
chosen consistently with the previous case so that the relation is local. The symmetry 
of wave functions for bosons and antisymmetry for fermions corresponds to operators 
that commute for bosons and anticommute for fermions (or for properties associated 


with fermions). 


As we know experimentally, and we will see follows from relativistic field theory, 
particles with half-integral spins obey Fermi-Dirac statistics. Let’s therefore consider 
the classical limit of fermions: This will prove useful later, when we define quantum 
field theory by quantizing classical field theory. (A similar approach can be taken to 
the quantum mechanics of fermions, but is less useful, which is one reason why non- 
relativistic quantum mechanics of spin 4 is usually done directly, without reference to 
the corresponding classical mechanics.) This will lead to generalizations of the con- 
cepts of brackets and coordinates. Bosons (more generally, bosonic operators) obey 


commutation relations, such as [x, p| = ih; in the classical limit they just commute. 
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Fermions obey anticommutation relations, such as {¢,¢'} = hi for a single fermionic 
harmonic oscillator, where 


{A,B} =AB+BA 


is the anticommutator, expressed in terms of “braces” {, }” instead of the “(square) 
brackets” [,] used for commutators. So, in the truly classical (not semiclassical) 
limit they anticommute, C¢' + ¢'C = 0. Actually, the simplest case is a single real 


(hermitian) fermion: Quantum mechanically, or semiclassically, we have 


h= {€,€} =2¢ 


while classically €* = 0. There is no analog for a single boson: [z,z] = 2? — x? = 0. 
This means that classical fermionic fields must be “anticommuting”: Two such objects 
get a minus sign when pushed past each other. As a result, the product of two 


fermionic quantities is bosonic, while fermionic times bosonic gives fermionic. 


Exercise [A2.1 
Show 


[B,C] =[A,D]=0 = [AB,CD]=4{A,C}[B, D] + 5[A, C]{B, D} 


Functions of anticommuting variables are simpler than functions of commuting 
variables in every way (algebra and calculus) except for keeping track of signs. This 
is because Taylor expansions in anticommuting variables always terminate. For in- 
stance, given a single anticommuting variable ~, we need to be able to Taylor expand 


functions in w, e.g., to find a basis for the states. We then have simply 


f(b) =a + by 


for constants a and b, since wy? = 0. This generalizes in an obvious way to a function 
of many anticommuting variables: For N such variables, we have 2% terms in the 
Taylor expansion, since any term can be either independent or first-order in each 


variable. 


Note that a has the same statistics as f, while b has the opposite; thus func- 
tions of anticommuting variables will include some anticommuting coefficients. In 
general, when Taylor expanding a function of anticommuting variables we must pre- 
serve the statistics: If we Taylor expand a quantity that is defined to be commuting 
(bosonic), then the coefficients of even powers of anticommuting variables will also be 
commuting, while the coefficients of odd powers will be anticommuting (fermionic), 


to maintain the commuting nature of that term (the product of the variables and 
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coefficient). Similarly, when expanding an anticommuting quantity the coefficients of 


even powers will also be anticommuting, while for odd powers it will be commuting. 


To work with wave functions that are functions of anticommuting numbers, we 
must also understand calculus of anticommuting variables. Since the Taylor expansion 
of a function terminates because wy? = 0, as follows from anticommutativity, an 


anticommuting derivative 0/OwW must also satisfy 


a\2 
a) ai 
Ow 
from either anticommutativity or the fact functions of ~ terminate at first order in w. 
We also need a w integral to define the inner product; indefinite integration turns out 


to be enough. The most important property of the integral is integration by parts; 


then, when acting on any function of w, 


[espn 7 [= 55 


where the normalization is fixed for convenience. This also implies a definition of the 


“(anticommuting) Dirac delta function” , 
dp) =p 
which satisfies 
[av 5 wt) = 10) 


for any function f. However, unlike the commuting case, we also have 


Exercise [A2.2 
Prove this is the most general possibility for anticommuting integration by 
considering action of integration and differentiation on the most general func- 
tion of ~ (which has only two terms). 

We can now consider operators that depend on both commuting (¢”) and anti- 


commuting (7) classical variables, 
“= (o", 0") 
Classically they satisfy the “graded” commutation relations (anticommutation if both 


elements are fermionic, commutation otherwise), not to be confused with the Poisson 
bracket, 


classically jo" 21 =0% Cro — Grd” = OU’ —W"o™ = Ya" +y"y" = 0 
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where we use mixed brackets (square and brace), the square one to the left to indicate 
the usual commutator unless both arguments are fermionic. This relation is then 


generalized to the graded quantum mechanical commutator or Poisson bracket by 
(>! BN} — RQMN OM Qpy = 6 
where {2 is constant, hermitian, and “graded antisymmetric” : 
Qun = 90: Qonn) = Mwy = Qnv + Qm = 9 


where |v] is the difference of the two orderings, as above, while (juv’) is the sum. For 


the standard normalization of canonically conjugate pairs of bosons 
gm _ p _— (q', p’) 
and self-conjugate fermions, we choose 


Qu — gh. (22% 98 = SIC, Cow = ( —) 
Because of signs resulting from ordering anticommuting quantities, we define 
derivatives unambiguously by their action from the left: 
O 
OPM 


The general Poisson bracket then can be written as 


Or = oy 


a nu_O 
OpM - OPN 


Since derivatives are normally defined to act from the left, there is a minus sign from 


semiclassically [A,B} =—-A 


pushing the first derivative to the left if A and that particular component of 0/0” 
are both fermionic. 


Exercise [A2.3 


Let’s examine some properties of fermionic oscillators: 


a For a single set of harmonic oscillators we have 
{o.al pe, {e,a) = Jala} =0 
Show that the “number operator” ala has the property 
{a, goe'ay = 


(Hint: Since this system has only 2 states, the easiest way is to check the 
action on those states. ) 
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b Define eigenstates of the annihilation operator (“coherent states”) by 
al) = 6I6) 
where ¢ is anticommuting. Show that this implies 


akg) = xl y= e"]0), eC) = (+6), w7"|C) = |ae), 


(ley =e", = facta e**c)(¢ 
Define wave functions in this space, ¥(¢*) = (¢|W). Taylor expand them in 
¢*, and compare this to the usual two-component representation using |0) 
and a'|0) as a basis. 


c Define the “supertrace” by 


str(A) = f acmag eC" (CLAlg 


Find the relation between any operator in this space and a 2x2 matrix, and 
find the expression for the supertrace in terms of this matrix. 
d For two sets of fermionic oscillators, we define 
{ay,a's}={ag,a'g}=1, other {, }=0 
Show that the new operators 


~ ie inmatia: 
a=41, az =e" a9 


(and their Hermitian conjugates) are equivalent to the original ones except 
that one set of the new oscillators commutes (not anticommutes) with the 
other ([@,@'2] = 0, etc.), even though each set satisfies the same anticom- 
mutation relations with itself ({@,,4',} = 1, etc.). Thus, choice of statistics 
is relevant only for particles in the same state: at most one fermion, but 
unlimited bosons. (This change of oscillator basis is called a “Klein trans- 
formation”. It can be useful for discrete sets of oscillators, but not for those 


labeled by a continuous parameter, because of the discontinuity in the com- 
mutation relations when the two labels are equal.) 

Exercise [A2.4 
Repeat exercise [A2.3 for the bosonic oscillator ({a, at] = 1), where the Hilbert 


space is i as, interchanging commu- 


finite-dimensional, paying attention to sign 
tators with anticommutators where necessary, etc. Show that the analog of 
part c defines the ordinary trace. 
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3. Lie algebra 


Since the same symmetries can be expressed in terms of different kinds of brackets 
for classical and quantum theories, it can be useful to work with just those properties 
that the Poisson bracket and commutator have in common, i.e., those that involve 


only the bracket of two operators, not just their ordinary product: 
[aA + BB,C] =alA,C]+ 6[B,C] for numbers a, ( (distributivity) 


[A, B] = —[B, A] (antisymmetry ) 
[A, [B, C]] + [B, [C, A]] + [C,[A, B]] =0 (Jacobi identity) 


with similar expressions (differing only by signs) for anticommutators or mixed com- 


mutators and anticommutators. 


Exercise IA3.1 
Find the generalizations of the Jacobi identity using also anticommutators, 
corresponding to the cases where 2 or 3 of the objects involved are considered 


as fermionic instead of bosonic. 


These properties also give an abstract definition of a form of multiplication, the 
“Lie bracket”, which defines a “Lie algebra”. (The first property is true of algebras 
in general.) Other Lie brackets include those defined by another, associative, form of 
multiplication, such as matrix multiplication, or operator (infinite matrix) multipli- 
cation as in quantum mechanics: In those cases we can write [A, B] = AB— BA, and 
use the usual properties of multiplication (distributivity and associativity) to derive 
the properties of the Lie bracket. (Another familiar example in physics is the “cross” 
product for three-vectors; however, this can also be expressed in terms of matrix 
multiplication.) The most important use of Lie algebras for physics is for describing 


(continuous) infinitesimal transformations, especially those describing symmetries. 


Exercise [A3.2 
Using only the commutation relations of the generators of the Galilean group 
(exercise IA1.2), check all the Jacobi identities. 


For describing transformations, we can also think of the bracket as a derivative: 
The “Lie derivative” of B with respect to A is defined as 


£L4B =[A, B] 


As a consequence of the properties of the Lie bracket, this derivative satisfies the 


usual properties of a derivative, including the Leibniz (distributive) rule. (In fact, 
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for coordinate transformations the Lie derivative is really a derivative with respect to 


the coordinates. ) 


We can now define finite transformations by exponentiating infinitesimal ones: 
Als (l+icke)A => A’=lim(1+ ieLg) <A = AGA 
In cases where we have [A, B] = AB — BA, we can also write 
cila A = eG Ae-iG 


This follows from replacing G on both sides with a@G and taking the derivative with 
respect to a, to see that both satisfy the same differential equation with the same 
initial condition. We then can recognize this as the way transformations are performed 
in quantum mechanics: A linear transformation that preserves the Hilbert-space inner 
product must be unitary, which means it can be written as the exponential of an 


antihermitian operator. 


Just as infinitesimal transformations define a Lie algebra with elements G, finite 


ones define a “Lie group” with elements 
gre 


(or similarly with Lg). The multiplication law of two group elements follows from 
the fact the product of two exponentials can be expressed in terms of multiple com- 
mutators: 


1 
eAeB — cAtB+alA B+... 


We now have the mathematical properties that define a group, namely: 

(1) a product, so that for two group elements g; and gz, we can define gig2, which is 
another element of the group (closure), 

(2) an identity element, so gl = Ig=4q, 


1 


(3) an inverse, where gg-' = g-'g =I, and 


(4) associativity, gi(9293) = (9192)93- 
In this case the identity is 1 = e°, while the inverse is (e4)~1 = e74. 

Thus two consecutive symmetry transformations will automatically involve Lie 
brackets of the generators of infinitesimal transformations. In particular, performing 
two consecutive infinitesimal transformations, followed by the inverse transformations 


in the same order, gives their bracket: 


ereBeAe-8 = exp(e“Be“)e-? w& el. 
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Since the elements of a Lie algebra form a vector space (we can add them and 


multiply by numbers), it’s useful to define a basis: 


G=a'G; => g=e% 


The parameters a’ then also give a set of coordinates for the Lie group. (Previously 
they were required to be infinitesimal, for infinitesimal transformations; now they are 
finite, but may be periodic, as determined by topological considerations that we will 
mostly ignore.) Now the multiplication rules for both the algebra and the group are 
given by those of the basis: 
Gi, G,] = —tfij*Ge 

for the (“structure”) constants f;;" = —f;;*, which define the algebra/group (but are 
ambiguous up to a change of basis). They satisfy the Jacobi identity 


IGi,G],Gy]=0 => fui fer” = 0 
A familiar example is SO(3) (SU(2)), 3D rotations, where f;;" = €i;, if we use G; = 
3eigkT jh. 
Another useful concept is a “subgroup”: If some subset of the elements of a group 
also form a group, that is called a “subgroup” of the original group. In particular, for 


a Lie group the basis of that subgroup will be a subset of some basis for the original 


group. For example, for the Galilean group J;; generate the rotation subgroup. 


Exercise IA3.3 
Let’s examine the subgroup of the Galilean group describing (spatial) coor- 


dinate transformations — rotations and spatial translations: 


a Show that the infinitesimal transformations are given by 
éx' = ae; +&, Cig = —E je 
where the e’s are constants. 
b Exponentiate to find the finite transformations 
g* = aA; + At 
c Show that A,’ must satisfy 
ANA; Sx = 5:3 


both to preserve the scalar product, and as a consequence of exponentiating. 
(Hint: Use matrix notation, and find the equivalent relation between A and 


A-1,) 


54 I. GLOBAL 


d Show that the last equation implies det A = +1, while exponentiating can 
give only det A = 1 (since +1 can’t change continuously to —1). What is the 
physical interpretation of a transformation with det A = —1? (Hint: Consider 


a simple example.) 


These results can be generalized to include anticommutators: When some of the 
basis elements G'; are fermionic, the corresponding parameters a’ are anticommuting 
numbers, the structure constants are defined by [G;,G;}, etc.. Then G = a'G; is 
bosonic term by term, as is g, so bosons transform into bosons and fermions into 
fermions, but Taylor expansion in the a’s will have both bosonic and fermionic co- 
efficients. (For example, for 6A = €B, if A is bosonic, then so is €B, but if also € is 
fermionic, then B will also be fermionic.) 


“en 
t 


For some purposes it is more convenient to absorb the in the infinitesimal 


transformation into the definition of the generator: 
G--iG => 6A= [G, A] = LoA, g= a, Ga G;| = fis®Ge 


This affects the reality properties of G: In particular, if g is unitary (gg' = J), as 
usually required in quantum mechanics, g = e’* makes G hermitian (G = G"), while 
g = e© makes G antihermitian (G = —G'). In some cases anithermiticity can be 
an advantage: For example, for translations we would then have P; = 0; and for 
rotations Jj; = 2;0,), which is more convenient since we know the 7’s in these (and 
any) coordinate transformations must cancel anyway. On the other hand, the U(1) 
transformations of electrodynamics (on the wave function for a charged particle) are 
just phase transformations g = e’” (where @ is a real number), so clearly we want the 
explicit 2; then the only generator has the representation G; = 1. In general we’ll find 
that for our purposes absorbing the 7’s into the generators is more convenient for just 


spacetime symmetries, while explicit 2’s are more convenient for internal symmetries. 


4. Relativity 


The Hamiltonian approach singles out the time coordinate. In relativistic theories 
time can be treated on equal footing with space, and it is useful to take advantage of 
this fact, so that the full Poincaré invariance is manifest. So, we treat the time ¢ and 
spatial position x’ together as a four-vector (or D-vector in D—1 space and 1 time 


dimension) 
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where m = 0,1,...,3 (or D—1), 1 = 1,2,3. Since the energy FE and three-momentum 
p' are canonically conjugate to them, 

[p’, 23] = —i8", [Et] = +7 
we define the 4-momentum as 
pv =(E,p)=1""Pn, Pm = Tm”; = [p™, 2") = —in™, [Pm 2”] = —48F, 


where we raise and lower indices with the “Minkowski metric”, in an “orthonormal 


basis” , 
0 12 3 
O/-1 0 0 0 
1} 0 1 0 0 
Tmn = = Py = —p° = -E 
2} 0 O 1 0 
3\.0 0 0 1 


in four spacetime dimensions, with obvious generalizations to higher dimensions. 
(Sometimes the metric with signs + — —— is used; we prefer — + ++ because it 
is more convenient for quantum calculations. The numbers of positive and negative 
eigenvalues of an invertible matrix is known as its “signature”.) Therefore, we now 
distinguish upper and lower indices in general: At least for position and momentum, 
the upper-indexed x” and p™ have the usual physical interpretation (so 7, and pm 
have extra signs). This is consistent with our previous nonrelativistic notation, since 
3-vector indices do not change sign upon raising or lowering. 

Of course, we could have done that much nonrelativistically. Relativity is a 
symmetry of kinematics and dynamics: In particular, a free, spinless, relativistic 


particle is completely described by the constraint 
pi +m =0 
where we define the covariant square 
DY =D Pm = PP mn = —(P°)” + (pt)? + (p*)? + (p°)? 


(The square of p on the left should not be confused with the second component of p 
on the right.) Our relativistic symmetry must leave this constraint invariant: Thus 
the metric defines the norm of a vector (and an invariant inner product). Therefore, 
to preserve Lorentz invariance it is important that we contract only an upper index 


with a lower index. For similar reasons, we have 
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so quantum mechanically p,, = —i0m. 


We use (natural/Planck) units c = 1 (where c is the speed of light in a vacuum), 
so length and duration are measured in the same units; c then appears only as a 
parameter for defining nonrelativistic expansions and limits. For example, in astro- 
nomical units, c=1 light year/year. In fact, the speed of light is no longer measured, 
but used to define the meter (since 1986) in terms of the second (itself defined by an 
atomic clock), as the distance light travels in a vacuum in exactly 1/299,792,458th 
of a second. So, using metric system units for c is no different than measuring land 
distance in miles and altitude in feet and writing ds? = dx? + dy? + b?dz*, where 
b=(1/5280)miles/foot is the slope of a line raised up 45°. (As we mentioned in sub- 
section IA1, similar remarks will soon apply to A and the kilogram, h = 1 being 


another natural/Planck unit.) 


Unlike the positive-definite nonrelativistic norm of a 3-vector V", for an arbitrary 


4-vector V™ we can have 


< timelike 
Ve 30 lightlike/null 
> spacelike 


In particular, the 4-momentum is timelike for massive particles (m? > 0) and lightlike 
for massless ones (while “tachyons”, with spacelike momenta and m? < 0, do not exist, 
for reasons that are most clear from quantum field theory). With respect to “proper” 
Lorentz transformations, those that can be obtained continuously from the identity, 
we can further classify timelike and lightlike vectors as “forward” and “backward”, 
since there is no way to continuously “rotate” a vector from forward to backward 
without it being spacelike (“sideways”), so only spacelike vectors can have their time 


component change sign continuously. 


The quantum mechanics will be described later, but the result is that this con- 
straint can be used as the wave equation. The main qualitative distinction from the 


nonrelativistic case in the constraint 
nonrelativistic: —2mE+p* =0 


relativistic: — E?+m?+p?=0 


is that the equation for the energy E = p® is now quadratic, and thus has two 
solutions: 
p? = tw, w = J (pi)? + m2 


Later we’ll see how the second solution is interpreted as an “antiparticle” . 
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The translations and Lorentz transformations make up the Poincaré group, the 
symmetry that defines special relativity. (The Lorentz group in D—1 space and 1 
time dimension is the “orthogonal” group “O(D—1,1)”. The “proper” Lorentz group 
“SO(D—1,1)”, where the “S” is for “special”, transforms the coordinates by a matrix 
whose determinant is 1. The Poincaré group is ISO(D—1,1), where the “I” stands 
for “inhomogeneous”.) For the spinless particle they are generated by coordinate 


transformations G; = (Pu, Ja): 
R= Pa; Job = UaPb] 


(where also a,b = 0,...,3). Then the fact that the physics of the free particle is 


invariant under Poincaré transformations is expressed as 
[Pa le m7) = [Jab, PD” =a m7) = 0 


Writing an arbitrary infinitesimal transformation as a linear combination of the gen- 
erators, we find 


oc” = a%e,” +e", Emn = —Enm 


where the e’s are constants. Note that antisymmetry of €,,,, does not imply antisym- 
metry of €," = Empy’”, because of additional signs. (Similar remarks apply to Joy.) 


Exponentiating to find the finite transformations, we have 
ar = ae so a Ae Ne Aig Ting = mn 


The same Lorentz transformations apply to p™, but the translations do not affect 
it. The condition on A follows from preservation of the Minkowski norm (or inner 
product), but it is equivalent to the antisymmetry of €,,” by exponentiating A = e° 


(compare exercise [A3.3). 


Since dx*p, is invariant under the coordinate transformations defined by the Pois- 
son bracket (the chain rule, since effectively pz ~ O,), it follows that the Poincaré 


invariance of p? is equivalent to the invariance of the line element 
ds? = —da™ da" tin 


which defines the “proper time” s. Spacetime with this indefinite metric is called 
“Minkowski space”, in contrast to the “Euclidean space” with positive definite metric 
used to describe nonrelativistic length measured in just the three spatial dimensions. 


(The signature of the metric is thus the numbers of space and time dimensions.) 
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Exercise [A4.1 
For general variables (q™, pm) and generator G, show from the definition of 


the Poisson bracket that 
OG 
6(dq”"’Pm) = —d (c — Pn) 
Pm 


and that this vanishes for any coordinate transformation. 


For the massive case, we also have 


dx 
ds 


For the massless case ds = 0: Massless particles travel along lightlike lines. However, 


pe =m 


we can define a new parameter 7 such that 


_ dx 
dr 


D 


is well-defined in the massless case. In general, we then have 
s=m7T 


While this fixes 7 = s/m in the massive case, in the massless case it instead restricts 
s = 0. Thus, proper time does not provide a useful parametrization of the world 
line of a classical massless particle, while 7 does: For any piece of such a line, dT is 
given in terms of (any component of) p* and dx*. Later we’ll see how this parameter 
appears in relativistic classical mechanics, and is useful for quantum mechanics and 
field theory. 
Exercise [A4.2 
Starting from the usual Lorentz force law for a massive particle in terms of 
proper time s (which doesn’t apply to m = 0), rewrite it in terms of 7 to find 
a form which can apply to m = 0. 
Exercise IA4.3 
The relation between x and p is closely related to the Poincaré conservation 


laws: 


a Show that 


dP, dS ap 0 > Piad Xp) = 1) 


and use this to prove that conservation of P and J imply the existence of a 


parameter 7 such that p* = dx*/dr. 


b Consider a multiparticle system (but still without spin) where some of the 


particles can interact only when at the same point (i.e., by collision; they 
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act as free particles otherwise). Define P, = )\,p/ and Ja, = >>, LisPh as 
the sum of the individual momenta and angular momenta (where we label 
the particle with “J”). Show that momentum conservation implies angular 


momentum conservation, 
AP. a0: = Pid =) 


where “A” refers to the change from before to after the collision(s). 


Special relativity can also be stated as the fact that the only physically observ- 
able quantities are those that are Poincaré invariant. (Other objects, such as vectors, 
depend on the choice of reference frame.) For example, consider two spinless par- 
ticles that interact by collision, producing two spinless particles (which may differ 
from the originals). Consider just the momenta. (Quantum mechanically, this is a 
complete description.) All invariants can be expressed in terms of the masses and the 


“Mandelstam variables” (not to be confused with time and proper time) 


s=—(pi+p2)?, t=—(pi—ps3)?, w=—(pi—pa)? 


where we have used momentum conservation, which shows that even these three 
quantities are not independent: 
4 
pr=—M;, pitp=pstp, > st+ttu= So mj 
T=1 

(The explicit index now labels the particle, for the process 1+2-+3+4.) The simplest 
reference frame to describe this interaction is the center-of-mass frame (actually the 
center of momentum, where the two 3-momenta cancel). In that Lorentz frame, using 
also rotational invariance, momentum conservation, and the mass-shell conditions, the 


momenta can be written in terms of these invariants as 


Pi = (als + my — m3), Ar2, 0, 0) 

p2 = alals + m3 —m}), —12,0, 0) 

D3 = aala(s + m3 — m7), 34 cos 8, Az34 sin 0, 0) 
n= +=(3(s +mi — mi), —A34 cos 0, —A34 sin 6, 0) 


_ 8? 4+ 2st — (5 mz)s + (m2 — m3) (m3 — m3) 
AN 12A34 


Ny = 4s — (mr + mz)"I[s — (mr — my)" 


The “physical region” of momentum space is then given by s > (m, + mz)? and 
(m3 + ma)”, and |cos 0| < 1. 
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Exercise [A4.4 
Derive the above expressions for the momenta in terms of invariants in the 
center-of-mass frame. 

Exercise [A4.5 
Find the conditions on s,t and u that define the physical region in the case 


where all masses are equal. 


For some purposes it will prove more convenient to use a “lightcone basis” 


+ —-— 2 3 
+/ 0 -1 0 0 
4 —-}|/-l 0O 0 0 = 
PY = (PtP!) > Mmm = 0 0 1 0]’ p? = —2p*p” + (p’)?+(p*)? 
3 0 0 O 1 


and similarly for the “lightcone coordinates” (*,x?,x?). (“Lightcone” is an unfor- 
tunate but common misnomer, having nothing to do with cones in most usages.) In 


this basis the solution to the mass-shell condition p? + m? = 0 can be written as 


: (p')? + m? 
== ae 
Dp 


(where now 7 = 2,3), which more closely resembles the nonrelativistic expression. 


(Note the change on indices + «+ — upon raising and lowering.) A special lightcone 
basis is the “null basis”, 


p= z(Ptp), p= lp’ — ip’), B= Zq(v° + wp”) 


+ - ¢ ft 
+/ 0 -1 0 0 
=> mn = ; _ : : : , p= —2ptp™ + Qp'p' 
e\ 0 OO 1g 
where the square of a vector is linear in each component. (We often use “ ” to 


indicate complex conjugation. ) 
Exercise 1A4.6 
Show that for p? +m? = 0 (m? > 0, p* # 0), the signs of p* and p~ are 
always the same as the sign of the canonical energy p”. 
Exercise [A4.7 
Consider the Poincaré group in | extra space dimension (D space, 1 time) for 


a massless particle. Interpret pt as the mass, and p~ as the energy. 
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a Show that the constraint p? = 0 gives the usual nonrelativistic expression for 


the energy. 
b Show that the subgroup of the Poincaré group generated by all generators that 


commute with p* is the Galilean group (in D—1 space and 1 time dimensions). 
Now nonrelativistic mass conservation is part of momentum conservation, 
and all the Galilean transformations are coordinate transformations. Also, 


positivity of the mass is related to positivity of the energy (see exercise [A4.4). 


There are two standard examples of relativistic effects on geometry. Without loss 
of generality we can consider 2 dimensions, by considering motion in just 1 spatial 
direction. One example is called “Lorentz-Fitzgerald contraction”: Consider a finite- 
sized object moving with constant velocity. In our 2D space, this looks like 2 parallel 


lines, representing the endpoints: 


(In higher dimensions, this represents a one-spatial-dimensional object, like a thin 
ruler, moving in the direction of its length.) If we were in the “rest frame” of this 
object, the lines would be vertical. In that frame, there is a simple physical way to 
measure the length of the object: Send light from a clock sitting at one end to a 
mirror sitting at the other end, and time how long it takes to make the round trip. 
A clock measures something physical, namely the proper time T = [ Vds? along its 
“worldline” (the curve describing its history in spacetime). Since ds? is by definition 


the same in any frame, we can calculate this quantity in our frame. 


t 


In this 2D picture lightlike lines are always slanted at +45°. The 2 lines representing 


the ends of the object are (in this frame) x = vt and x = L + vt. Some simple 
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geometry then gives 


T= = > L=V1-vT/2 
This means that the length L we measure for the object is shorter than the length 
T/2 measured in the object’s rest frame by a factor V1 — v? < 1. Unlike T, the L we 
have defined is not a physical property of the object: It depends on both the object 
and our velocity with respect to it. There is a direct analogy for rotations: We can 
easily define an infinite strip of constant width in terms of 2 parallel lines (the ends), 
where the width is defined by measuring along a line perpendicular to the ends. If 
we instead measure at an arbitrary angle to the ends, we won’t find the width, but 


the width times a factor depending on that angle. 


The most common point of confusion about relativity is that events that are 
simultaneous in one reference frame are not simultaneous in another (unless they are 
at the same place, in which case they are the same event). A frequent example is of 
this sort: You have too much junk in your garage, so your car won’t fit anymore. So 
your spouse/roommate/whatever says, “No problem, just drive it near the speed of 
light, and it will Lorentz contract to fit.” So you try it, but in your frame inside the 
car you find it is the garage that has contracted, so your car fits even worse. The real 
question is, “What happens to the car when it stops?” The answer is, “It depends on 
when the front end stops, and when the back end stops.” You might expect that they 
stop at the same time. That’s probably wrong, but assuming it’s true, we have (at 
least) two possibilities: (1) They stop at the same time as measured in the garage’s 
reference frame. Then the car fits. However, in the car’s frame (its initial fast frame), 
the front end has stopped first, and the back end keeps going until it smashes into 
the front enough to make it fit. (2) They stop at the same time in the car’s frame. In 
the garage’s frame, the back end of the car stops first, and the front end keeps going 


until it smashes out the back of the garage. 


The other standard example is “time dilation”: Consider two clocks. One moves 
with constant velocity, so we choose the frame where it is at rest. The other moves 
at constant speed in this frame, but it starts at the position of the first clock, moves 
away, and then returns. (It is usually convenient to compare two clocks when they 
are at the same point in space, since that makes it unambiguous that one is reading 


the two clocks at the same time.) 
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A simple calculation shows that when the moving clock returns it measures a time 
that is shorter by a factor of V1 —v?. Of course, this also has a Newtonian analog: 
Curves between two given points are longer than straight lines. For relativity, straight 
lines are always the longest timelike curves because of the funny minus sign in the 


metric. 


Exercise IA4.8 
You are standing in the road, and a police car comes toward you, flashing 
its lights at regular intervals. It runs you down and keeps right on going, 
as you watch it continue to flash its lights at you at the same intervals (as 
measured by the clock in the car). Treat this as a two-dimensional problem 
(one space, one time), and approximate the car’s velocity as constant. Draw 
the Minkowski-space picture (including you, the car, and the light rays). If 
the car moves at speed v and flashes its lights at intervals ty (as measured by 
the clock’s car), at what intervals (according to your watch) do you see the 


lights flashing when it is approaching, and at what intervals as it is leaving? 


Special relativity is so fundamental a part of physics that in some areas of physics 
every experiment is more evidence for it, so that the many early experimental tests 


of it are more of historical interest than scientific. 


The Galilean group is a symmetry of particles moving at speeds small compared 
to light, but electromagnetism is symmetric under the Poincaré group (actually the 
conformal group). This caused some confusion historically: Since the two groups have 
only translations and rotations in common, it was assumed that nature was invariant 
under no velocity transformation (neither Galilean nor Lorentz boost). In particular, 
the speed of light itself would seem to depend on the reference frame, since the laws 
of nature would be correct only in a “rest frame”. To explain “at rest with respect 
to what,” physicists invented something that is invariant under rotations and space 
and time translations, but not velocity transformations, and called this “medium” for 


wave propagation the “ether,” probably because they were only semiconscious at the 
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time. (The idea was supposed to be like sound traveling through the air, although 


nobody had ever felt an ethereal wind.) 


Many experiments were performed to test the existence of the ether, or at least to 
show that the wave equation for light was correct only in references frames at rest. So 
as not to keep you in suspense, we first tell you the general result was that the ether 
theory was wrong. On the contrary, one finds that the speed of light in a vacuum is 
measured as c in both of two reference frames that are moving at constant velocity 
with respect to each other. This means that electromagnetism is right and Newtonian 
mechanics is wrong (or at least inaccurate), since Maxwell’s equations are consistent 
with the speed of light being the same in all frames, while Newtonian mechanics is 


not consistent with any speed being the same in all frames. 


The first such experiment was performed by A.A. Michelson and E.W. Morley 
in 1887. They measured the speed of light in various directions at various times of 
year to try to detect the effect of the Earth’s motion around the sun. They detected 
no differences, to an accuracy of 1/6th the Earth’s speed around the sun (~ 10~‘c). 
(The method was interferometry: seeing if a light beam split into perpendicular paths 


of equal length interfered with itself.) 
Another interesting experiment was performed in 1971 by J.C. Hafele and R. 


Keating, who compared synchronized atomic clocks, one at rest with respect to the 
Earth’s surface, one carried by plane (a commercial airliner) west around the world, 
one east. Afterwards the clocks disagreed in a way predicted by the relativistic effect 


of time dilation. 


Probably the most convincing evidence of special relativity comes from experi- 
ments related to atomic, nuclear, and particle physics. In atoms the speed of the 
electrons is of the order of the fine structure constant (1/137) times c, and the 
corresponding effects on atomic energy levels and such is typically of the order of the 
square of that (© 10~*), well within the accuracy of such experiments. In particle 
accelerators (and also cosmic rays), various particles are accelerated to over 99% c, 
so relativistic effects are exaggerated to the point where particles act more like light 
waves than Newtonian particles. In nuclear physics the relativistic relation between 
mass and energy is demonstrated by nuclear decay where, unlike Newtonian mechan- 
ics, the sum of the (rest) masses is not conserved; thus the atomic bomb provides 
a strong proof of special relativity (although it seems like a rather extreme way to 


prove a point). 
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5. Discrete: C, P, T 


By considering only symmetries than can be obtained continuously from the iden- 
tity (Lie groups), we have missed some important symmetries: those that reflect some 
of the coordinates. It’s sufficient to consider a single reflection of a spacelike axis, 
and one of a timelike axis; all other reflections can be obtained by combining these 
with the continuous (“proper, orthochronous”) Lorentz transformations. (Spacelike 
and timelike vectors can’t be Lorentz transformed into each other, and reflection of 
a lightlike axis won’t preserve p? + m?.) Also, the reflection of one spatial axis can 
be combined with a 7 rotation about that axis, resulting in reflection of all three 
spatial coordinates. (Similar generalizations hold for higher dimensions. Note that 
the product of an even number of reflections about different axes is a proper rotation; 
thus, for even numbers of spatial dimensions reflections of all spatial coordinates are 
proper rotations, even though the reflection of a single axis is not.) The reversal of the 
spatial coordinates is called “parity (P)”, while that of the time coordinate is called 
“time reversal” (“T”; actually, for historical reasons, to be explained shortly, this is 
usually labeled “CT”.) These transformations have the same effect on the momen- 
tum, so that the definition of the Poisson bracket is also preserved. These “discrete” 
transformations, unlike the proper ones, are not symmetries of nature (except in cer- 
tain approximations): The only exception is the transformation that reflects all axes 
(CPE). 

While the metric 7, is invariant under all Lorentz transformations (by defini- 


tion), the “Levi-Civita tensor” 


Emnpq totally antisymmetric, Eping = —€13 = 1 

is invariant under only proper Lorentz transformations: It has an odd number of 
space indices and of time indices, so it changes sign under parity or time reversal. 
(More precisely, under P or T the Levi-Civita tensor does not suffer the expected 
sign change, since it’s constant, so there is an “extra” sign compared to the one 
expected for a tensor.) Consequently, we can use it to define “pseudotensors”: Given 
“polar vectors”, whose signs change as position or momentum under improper Lorentz 
transformations, and scalars, which are invariant, we can define “axial vectors” and 


“pseudoscalars” as 
Va = tae’ C°D", a) = Cig BCD" 


which get an extra sign change under such transformations (P or CT, but not CPT). 
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There is another such “discrete” transformation that is defined on phase space, 
but which does not affect spacetime. It changes the sign of all components of the 
momentum, while leaving the spacetime coordinates unchanged. This transformation 


” and is also only an approximate symmetry in 


is called “charge conjugation (C) 
nature. (Quantum mechanically, complex conjugation of the position-space wave 
function changes the sign of the momentum.) Furthermore, it does not preserve the 
Poisson bracket, but changes it by an overall sign. (The misnomer “CT” for time 
reversal follows historically from the fact that the combination of reversing the time 
axis and charge conjugation preserves the sign of the energy.) The physical meaning 
of this transformation is clear from the spacetime-momentum relation of relativistic 
classical mechanics p = m dx/ds: It is proper-time reversal, changing the sign of s. 
The relation to charge follows from “minimal coupling”: The “covariant momentum” 
m dx/ds = p+ qA (for charge q) appears in the constraint (p+ qA)? + m? = 0 in an 


electromagnetic background; p — —p then has the same effect as gq — —q. 


In the previous subsection, we mentioned how negative energies were associated 
with “antiparticles”. Now we can better see the relation in terms of charge conjuga- 
tion. Note that charge conjugation, since it only changes the sign of 7 but does not 
effect the coordinates, does not change the path of the particle, but only how it is 
parametrized. This is also true in terms of momentum, since the velocity is given by 
p'/p°. Thus, the only observable property that is changed is charge; spacetime prop- 
erties (path, velocity, mass; also spin, as we’ll see later) remain the same. Another 
way to say this is that charge conjugation commutes with the Poincaré group. One 
way to identify an antiparticle is that it has all the same kinematical properties (mass, 
spin) as the corresponding particle, but opposite sign for internal quantum numbers 
(like charge). (Another way is pair creation and annihilation: See subsection IIIB5 
below.) 


All these transformations are summarized in the table: 


(The upper-left 3x3 matrix contains the definitions, the rest is implied.) In terms 


of complex wave functions, we see that C' is just complex conjugation (no effect on 
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‘““” in the Fourier 


coordinates, but momentum and energy change sign because of the 
transform). On the other hand, for CT and P there is no complex conjugation, but 
changes in sign of the coordinates that are arguments of the wave functions, and 
also on the corresponding indices — the “orbital” and “spin” parts of these discrete 
transformations. (For example, derivatives 0, have sign changes because x% does, so 
a vector wave function w* must have the same sign changes on its indices for 0,~* to 


transform as a scalar.) The other transformations follow as products of these. 


Exercise IA5.1 
Find the effect of each of these 7 transformations on wave functions that are: 


a scalars, b pseudoscalars, c vectors, d axial vectors. 


However, from the point of view of the “particle” there 7s some kind of kinematic 
change, since the proper time has changed sign: If we think of the mechanics of a 
particle as a one-dimensional theory in 7 space (the worldline), where x(7) (as well 
as any such variables describing spin or internal symmetry) is a wave function or field 
on that space, then t — —7 is T on that one-dimensional space. (The fact we don’t 
get CT can be seen when we add additional variables: For example, if we describe 
internal U(N) symmetry in terms of creation and annihilation operators a™ and aj, 
then C mixes them on both the worldline and spacetime. So, on the worldline we 
have the “pure” worldline geometric symmetry CT times C = T.) Thus, in terms of 
“zeroth quantization” , 


worldline T — spacetime C' 


On the other hand, spacetime P and CT are simply internal symmetries with respect 


to the worldline (as are proper, orthochronous Poincaré transformations). 


Quantum mechanically, there is a good reason for particles of negative energy: 
They appear in complex-conjugate wave functions, since (e~*)* = e+. Since we 
always evaluate expressions of the form (f|7), it is natural for energies of both signs 
to appear. 

In classical field theory, we can identify a particle with its antiparticle by requiring 
the field to be invariant under charge conjugation: For example, for a scalar field 


(spinless particle), we have the reality condition 


o(x) = o*(2) 
or in momentum space, by Fourier transformation, 
$(p) = [o(—p)* 


which implies the particle has charge zero (neutral). 
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Conformal 


Poincaré transformations are the most general coordinate transformations that 


preserve the mass condition p? + m? = 0, but there is a larger group, the “confor- 


mal group”, that preserves this constraint in the massless case. Although conformal 


symmetry is not observed in nature, it is important in all approaches to field theory: 


(1) 


First of all, it is useful in the construction of free theories (see subsections IIB1-4 
below). All massive fields can be described consistently in quantum field theory 
in terms of coupling massless fields. Massless theories are a subset of conformal 
theories, and some conditions on massless theories can be found more easily by 
finding the appropriate subset of those on conformal theories. This is related to 
the fact that the conformal group, unlike the Poincaré group, is “simple”: It has 
no nontrivial subgroup that transforms into itself under the rest of the group (like 


the way translations transform into themselves under Lorentz transformations). 


In interacting theories at the classical level, conformal symmetry is also impor- 
tant in finding and classifying solutions, since at least some parts of the action are 
conformally invariant, so corresponding solutions are related by conformal trans- 
formations (see subsections IIIC5-7). Furthermore, it is often convenient to treat 
arbitrary theories as broken conformal theories, introducing fields with which 
the breaking is associated, and analyze the conformal and conformal-breaking 
fields separately. This is particularly true for the case of gravity (see subsections 
IXA7,B5,C2-3,XA3-4,B5-7). 


Within quantum field theory at the perturbative level, the only physical quantum 
field theories are ones that are conformal at high energies (see subsection VIIIC1). 
The quantum corrections to conformal invariance at high energy are relatively 
simple. 

Beyond perturbation theory, the only quantum theories that are well defined may 
be just the ones whose breaking of conformal invariance at low energy is only 
classical (see subsections VIIC2-3,VHIA5-6). Furthermore, the largest possible 
symmetry of a nontrivial S-matrix is conformal symmetry (or superconformal 


symmetry if we include fermionic generators). 


Self-duality (a generalization of a condition that equates electric and magnetic 
fields) is useful for finding solutions to classical field equations as well as sim- 
plifying perturbation theory, and is closely related to “twistors” (see subsections 


IIB6-7,C5,HIC4-7). In general, self-duality is related to conformal invariance: For 
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example, it can be shown that the free conformal theories in arbitrary even di- 
mensions are just those with (on-mass-shell) field strengths on which self-duality 
can be imposed. (In arbitrary odd dimensions the free conformal theories are just 


the scalar and spinor.) 


Transformations A that satisfy 
[A°(a)pa, p*] = C(x)p* 


for some ¢ also preserve p” = 0, although they don’t leave p? invariant. Equivalently, 


we can look for coordinate transformations that scale 
da” = €(x)dx” 


Exercise IA6.1 
Find the conformal group explicitly in two dimensions, and show it’s infinite 
dimensional (not just the SO(2,2) described below). (Hint: Use lightcone 


coordinates. ) 


This symmetry can be made manifest by starting with a space with one extra 


space and time dimension: 


y=tyyiy) = y=yyPnap = (y*? —2Qyty 


2 — y%y’na» uses the usual D-dimensional Minkowski-space metric mas, 


where (y*) 
and the two additional dimensions have been written in a lightcone basis (not to 
be confused for the similar basis that can be used for the Minkowski metric itself). 
With respect to this metric, the original SO(D—1,1) Lorentz symmetry has been 
enlarged to SO(D,2). This is the conformal group in D dimensions. However, rather 
than also preserving (D+2)-dimensional translation invariance, we instead impose the 
constraint and invariance 
P=) or aay 


This reduces the original space to the “projective” (invariant under the ¢ scaling) 


lightcone (which in this case really is a cone). 
These two conditions can be solved by 


y®=ew*, w= (24,1, 42724) 
Projective invariance then means independence from e (y*), while the lightcone con- 
dition has determined y~. y? = 0 implies y- dy = 0, so the simplest conformal 
invariant is 
dy* = (edw + wde)? = e?dw? = e? dx? 
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where we have used w? = 0 > w-dw =0. This means any SO(D,2) transformation 


on y“ will simply scale dx?, and scale e? in the opposite way: 


2 
dx’? = (5) dx? 
e€ 


in agreement with the previous definition of the conformal group. 


The explicit form of conformal transformations on x* = y*/y* now follows from 


their linear form on y“, using the generators 
GAP = ylArPl, Ura] = 188 


of SO(D,2) in terms of the momentum r4 conjugate to y4. (These are defined the same 
way as the Lorentz generators J” = x'*p'l.) For example, Gt~ just scales x%. (Scale 
transformations are also known as “dilatations”, or just “dilations”.) We can also 
recognize G* as generating translations on x*. The only complicated transformations 
are generated by G~*, known as “conformal boosts” (acceleration transformations). 
Since they commute with each other (like translations), it’s easy to exponentiate to 


find the finite transformations: 
y=ey, Gay a4 


for some constant D-vector v* (where 04 = 0/dy*). Since the conformal boosts act 
as “lowering operators” for scale weight (+ — a — —), only the first three terms in 


the exponential survive: 
Gy =0, GYo=auy Gy Su, => 
yo =y, yayrtoy, yt=yttoytgy = 
ge + sur 


~ L+u-a+ jv22? 


/a 


using 2° = y®/y*, y~/y* = 52”. 
Exercise [A6.2 
Make the change of variables to 77 = y*/yt, e = y*, z = gy”. Express 
ra in terms of the momenta (pq,n, 5s) conjugate to (z*,e, z). Show that the 
conditions y? = y4r4 = r? = 0 become z = en = p? = 0 in terms of the new 
variables. 
Exercise [A6.3 


Find the generator of infinitesimal conformal boosts in terms of x° and pg. 
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We actually have the full O(D,2) symmetry: Besides the continuous symmetries, 
and the discrete ones of SO(D—1,1), we have a second “time” reversal (from our 
second time dimension): 


a 
yt o-y => Wor 


Li? 
gv 


This transformation is called an “inversion” . 


Exercise IA6.4 
Show that a finite conformal boost can be obtained by performing a transla- 


tion sandwiched between two inversions. 


Exercise [A6.5 
The conformal group for Euclidean space (or any spacetime signature) can be 
obtained by the same construction. Consider the special case of D=2 for these 
SO(D+1,1) transformations. (This is a subgroup of the 2D superconformal 
group: See exercise I[A6.1.) Use complex coordinates for the two “physical” 
dimensions: 


c= q(x" + ix”) 


a Show that the inversion is 


b Show that the conformal boost is (using a complex number also for the boost 


vector) 
z 


Toke 

Exercise IA6.6 
Any parity transformation (reflection in a spatial axis) can be obtained from 
any other by a rotation of the spatial coordinates. Similarly, when there 
is more than one time dimension, any time reversal can be obtained from 
another (but time reversal can’t be rotated into parity, since a timelike vector 
can’t be rotated into a spacelike one). Thus, the complete orthogonal group 
O(m,n) can be obtained from those transformations that are continuous from 
the identity by combining them with 1 parity transformation and 1 time 


reversal transformation (for mn/0). 


a For the conformal group, find the rotation (in terms of an angle) that rotates 


between the two time directions, and express its action on 2°. 


b Show that for angle 7 it produces a transformation that is the product of time 


reversal and inversion. 
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c Use this to show that inversion is related to time reversal by finding the 


continuum of conformal transformations that connect them. 
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In the previous section we saw various spacetime groups (Galilean, Poincaré, 
conformal) in terms of how they acted on coordinates. This not only gave them a 
simple physical interpretation, but also allowed a direct relation between classical 
and quantum theories. However, as we know from studying rotations in quantum 
theory in terms of spin, we will often need to study symmetries of quantum theories 


for which the classical analog is not so useful or perhaps even nonexistent. 


We therefore now consider some general results of group theory, mostly for con- 
tinuous groups. We use tensor methods, rather than the slightly more powerful but 
greatly less convenient Cartan-Weyl-Dynkin methods. Much of this section should 
be review, but is included here for completeness; it is not intended as a substitute for 
a group theory course, but as a summary of those results commonly useful in field 


theory. 


1. Matrices 


Matrices are defined by the way they act on some vector space; an nxn matrix 
takes one n-component vector to another. Given some group, and its multiplication 
table (which defines the group completely), there is more than one way to represent 
it by matrices. Any set of matrices we find that has the same multiplication table as 
the group elements is called a “representation” of that group, and the vector space on 
which those matrices act is called the “representation space.” The representation of 
the algebra or group in terms of explicit matrices is given by choosing a basis for the 
vector space. If we include infinite-dimensional representations, then a representation 
of a group is simply a way to write its transformations that is linear: ~’ = Mw is 
linear in w. More generally, we can also have a “realization” of a group, where the 
transformations can be nonlinear. These tend to be more cumbersome, so we usually 
try to make redefinitions of the variables that make the realization linear. A precise 
definition of “manifest symmetry” is that all the realizations used are linear. (One 
possible exception is “affine” or “inhomogeneous” transformations wy’ = My + Mb, 
such as the usual coordinate representation of Poincaré transformations, since these 
transformations are still very simple, because they are really still linear, though not 


homogeneous. ) 


Exercise IB1.1 


Consider a general real affine transformation w’ = My+V on an n-component 
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vector w for arbitrary real n x n matrices M and real n-vectors V. A general 


group element is thus (M,V). 


a Perform 2 such transformations consecutively, and give the resulting “group 
multiplication” rule for (M,V,) “x” (Mo, V2) = (Ms, V3). 


b Find the infinitesimal form of this transformation. Define the n?-+-n generators 


as operators on 7, in terms of w* and 0/Ow". 
c Find the commutation relations of these generators. 
d Compare all the above with (nonrelativistic) rotations and translations. 


Exercise IB1.2 


Let’s consider some properties of matrix inverses: 


a Show (AB)~! = B~'A™! for matrices A and B that have inverses but don’t 


necessarily commute with each other. 


b Show that ‘ - 4.9% 99n2 
—— = — —-—B—+—B—B—-... 
A+B A A°ATA’AVA 
(There may be other assumptions; ignore convergence questions. Hint: Mul- 


tiply both sides by A + B.) 


For convenience, we write matrices with a Hilbert-space-like notation, but unlike 
Hilbert space we don’t necessarily associate bras directly with kets by Hermitian 
conjugation, or even transposition. In general, the two spaces can even be different 
sizes, to describe matrices that are not square; however, for group theory we are 
interested only in matrices that take us from some vector space into itself, so they 
are square. Bras have an inner product with kets, but neither necessarily has a norm 
(inner product with itself): In general, if we start with some vector space, written 
as kets, we can always define the “dual” space, written as bras, by defining such an 
inner product. In our case, we may start with some representation of a group, in 
terms of some vector space, and that will give us directly the dual representation. (If 
the representation is in terms of unitary matrices, we have a Hilbert space, and the 


dual representation is just the complex conjugate.) 


So, we define column vectors |7) with a basis |/), and row vectors (w| with a 
basis (;|, where J = 1,...,.n to describe nxn matrices. The two bases have a relative 


normalization defined so that the inner product gives the usual component sum: 


Ww) = Fob lax Gl) =6F = Old) = xd ld) =, OX) = x7 
These bases then define not only the components of vectors, but also matrices: 


M=(|")M7"(a|, (r|M|?) = My’ 
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where as usual the J on the component (matrix element) M;/ labels the row of the 
matrix M, and J the column. This implies the usual matrix multiplication rules, 


inserting the identity in terms of the basis, 
T=|*)(xl = (MN)17 = (MI) (x?) = MIAN? 
Closely related is the definition of the trace, 
tr M=(,|M|')=M; = tr(MN) =tr(NM) 


(We'll discuss the determinant later.) 


The bra-ket notation is really just matrix notation written in a way to clearly 
distinguish column vectors, row vectors, and matrices. We can, of course, also use 


the usual pictorial notation 


V1 
W=([e], =O ..) 
1 2 7 
1{M) M2 My! 
2) My! M2 My! 
M = : 
1| Mp M? M,! 


This is useful only when listing individual components. 


We can easily translate transformation laws from matrix notation into index no- 
tation just by using a basis for the representation space. We now write g and G to 
refer to either matrix representations of the group and algebra elements, or to the 
abstract elements: i.e., either to a specific representation, or the most general one. 
Again writing g = e’@, 

gl) =V)os', GP) = (Gs 
G=0'G:, 4\b) = iG) =|)io'(Gir'ys > bh = ia (Gi)r'Ys 


The dual space isn’t needed for this purpose. However, for any representation of a 
group, the transpose 
(MT), _ MM, 


of the inverse of those matrices also gives a representation of the group, since 


ng2=93 => (g)* "(92)" * = (gs) * 
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[Gi, Go| = Gs => [-GT, —G}] = -G3 


This is the dual representation, which follows from defining the above inner product 


to be invariant under the group: 
dvix) =0 = byt = —ib’a"(Gi) a 


The complex conjugate of a complex representation is also a representation, since 


n92=93 => 92% = 493" 


[Gi, Go] =G,; => [G1*, Go*] = G;* 


From any given representation, we can thus find three others from taking the dual 


and the conjugate: In matrix and index notation, 


wy =g: vr = orbs 
Vag): Pag tty? 
Wag: wa=gtyd; 
Wage: plage tty’ 


since (g-')", g*, and (g~)' (but not g’, etc.) satisfy the same multiplication algebra 
as g, including ordering. We use up/down and dotted/undotted indices to denote 
the transformation law of each type of index; contracting undotted up indices with 
undotted down indices preserves the transformation law as indicated by the remaining 
indices, and similarly for dotted indices. These four representations are not necessarily 
independent: Imposing relations among them is how the classical groups are defined 


(see subsections IB4-5 below). 


2. Representations 


oP) 


For example, we always have the “adjoint” representation of a Lie group/algebra, 


which is how the algebra acts on its own generators: 
(1) adjoint as operator: G=a'G;,, A='G; => 6A=i[G,A] = Ba'f,;"G, 
= dB! = —iBFal(Gi)k',  (Gi)j" = ifs" 


This gives us two ways to represent the adjoint representation space: as either the 


usual vector space, or in terms of the generators. Thus, we either use the matrix 
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A = ('G; (for arbitrary representation of the matrices G;, or treating G; as just 


abstract generators), or we can write A as a row vector: 
(2) adjoint as vector: (A|=6%(;|) = 6(A|=—i(A|G 
=>  66"(i| = —iB*a (Gy)x' (il 


The adjoint representation also provides a convenient way to define a (symmetric) 


group metric invariant under the group, the “Cartan metric”: 
nig = tra(GiGy) = —fir' fir 


(tr4 refers to the trace taken with respect to the representation A; equivalently, we 
could take the G’s inside the trace to be in the A representation.) For “Abelian” 
groups the structure constants vanish, and thus so does this metric. “Semisimple” 
groups are those where the metric is invertible (no vanishing eigenvalues). A “simple” 
group has no nontrivial subgroup that transforms into itself under the rest of the 
group: Semisimple groups can be written as “products” of simple groups. “Compact” 
groups are those where it is positive definite (all eigenvalues positive); they are also 
those for which the invariant volume of the group space is finite. For simple, compact 


groups it’s convenient to choose a basis where 
Mig = CAdij 


for some constant c4 (the “Dynkin index” for the adjoint representation). For some 


general irreducible representation R of such a group the normalization of the trace is 
CR 
tre(GiG;) = Cron = Nh 
CA 


Now the proportionality constant cr/c, is fixed by the choice of R (only), since we 


have already fixed the normalization of our basis. 


Exercise IB2.1 
What is cr for an Abelian group? (Hint: not just 1.) 


In general, the cyclicity property of the trace implies, for any representation, that 
0= tr (iG, G]) = —ifij"tr(Ge) 
so tr(G;) = 0 for semisimple groups. Similarly, we find 


fisx = fij'me = 7% tra([Gi, G;]Gy) 
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is totally antisymmetric: For semisimple groups, this implies the total antisymmetry 
of the structure constants f;;*, up to factors (which are absent for compact groups in 
a basis where 7;; ~ 6;;). This also means the adjoint representation is its own dual. 
(For example, for the compact group SO(3), we have n;; = —€ix€j~, = 26;;.) Thus, 


we can write A in a third way, as a column vector 
(3) adjoint as dual vector: |A) = |')3; = |")G’nj; =  6|A) =iG|A) 


We can also do this for Abelian groups, by defining an invertible metric unrelated to 
the Cartan metric: This is trivial for Abelian groups, since the generators themselves 


are invariant, and thus so is any metric on them. 


An identity related to the trace one is the normalization of the value kr of the 


“Casimir operator” for any particular representation, 
n!GiG; = krpl 


Its proportionality to the identity follows from the fact that it commutes with each 
generator: 


[n*G5G,, Gi] = -if?*{G;,G.} = 0 


using the antisymmetry of the structure constants. (Thus it takes the same value on 
any component of an irreducible representation, since they are all related by group 


transformations.) By tracing this identity, and contracting the trace identity, 


Cc ee 
<4 = tra(n@GiG;) = kradp 


where dr = trpr(Z) is the dimension of that representation. 


Although quantum mechanics is defined on Hilbert space, which is a kind of com- 
plex vector space, more generally we want to consider real objects, like spacetime 
vectors. This restricts the form of linear transformations: Specifically, if we absorb 
i’s as g = e®, then in such representations G itself must be real. These represen- 
tations are then called “real representations”, while a “complex representation” is 
one whose representation isn’t real in any basis. A complex representation space can 
have a real representation, but a real representation space can’t have a complex rep- 
resentation. In particular, coordinate transformations (of real coordinates) have only 
real representations, which is why absorbing the 2’s into the generators is a useful 
convention there. For semisimple unitary groups, hermiticity of the generators of the 


adjoint representation implies (using total antisymmetry of the structure constants 
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and reality of the Cartan metric) that the structure constants are real, and thus the 


adjoint representation is a real representation. More generally, any real unitary rep- 


resentation will have antisymmetric generators (G = G* = —Gt > G = —G"). If the 
complex conjugate representation is the same as the original (same matrices up to a 
similarity transformation g* = MgM~*), but the representation is not real, then it 
is called “pseudoreal”. (An example is the spinor of SU(2), to be described in section 


IC.) 


For any representation g of the group, a transformation g — goggo sae oy every 
group element g for some particular group element go clearly maps the algebra to 
itself, and preserves the multiplication rules. (Similar remarks apply to applying the 
transformation to the generators.) However, the same is true for complex conjugation, 
g — g*: Not only are the multiplication rules preserved, but for any element g 


* is also an element. (This can be shown, 


of that representation of the group, g 
e.g., by defining representations in terms of the values of all the Casimir operators, 
contructed from various powers of the generators.) In quantum mechanics (where 
the representations are unitary), the latter is called an “antiunitary transformation”. 
Although this is a symmetry of the group, it cannot be reproduced by a unitary 


transformation, except when the representation is (pseudo)real. 


Exercise IB2.2 
Show how this works for the Abelian group U(1). Explain this antiunitary 
transformation in terms of two-dimensional rotations O(2). (U(1)=SO(2), 


the “proper rotations” obtained continuously from the identity.) 


A very simple way to build a representation from others is by “direct sum”. If we 
have two representations of a group, on two different spaces, then we can take their 
direct sum by just putting one column vector on top of the other, creating a bigger 
vector whose size (“dimension”) is the sum of that of the original two. Explicitly, if 
we start with the basis |’) for the first representation and |’) for the second, then 
the union (|*), |”)) is the basis for the direct sum. (We can also write |’) = (|*), |“), 
where . = l,...,m; ev =1,...,n; J =1,..,m,m+1,...,.m +n.) The group then acts 


on each part of the new vector in the obvious way: 


/ 


w = |“) ob., X= heya g\’) = Ge’, al) = a ee 
= P= hel w=Wel) or W= (‘) 


K L kK! if Oe 0 
gl) = |"on'th, I" oe” xu or wo =( , 


80 I. GLOBAL 


(We can replace the © with an ordinary + if we understand the basis vectors to be 
now in a bigger space, where the elements of the first basis have zeros for the new 
components on the bottom while those of the second have zeros for the new compo- 
nents on top.) The important point is that no group element mixes the two spaces: 
The group representation is block diagonal. Any representation that can be written 
as a direct sum (after an appropriate choice of basis) is called “reducible”. For exam- 
ple, we can build a reducible real representation from an irreducible complex one by 
just taking the direct sum of this complex representation with the complex conjugate 
representation. Similarly, we can take direct sums of more than two representations. 

A more useful way to build representations is by “direct product”. The idea there 
is to take a colummn vector and a row vector and use them to construct a matrix, 
where the group element acts simultaneously on rows according to one representation 
and columns according to the other. If the two original bases are again |) and |"), 


the new basis can also be written as |’) = |“) (I = 1,..., mn). Explicitly, 
=) @P bu, GOP) =I) OP gn ge => gur™™ = gu" 


or in terms of the algebra 


/ 


/ / if 
Gu” = Gib" = Oey 


A familar example from quantum mechanics is rotations (or Lorentz transformations), 
where the first space is position space (so v is the continuous index x), acted on by 
the orbital part of the generators, while the second space is finite-dimensional, and is 
acted on by the spin part of the generators. Direct product representations are usually 
reducible: They then can be written also as direct sums, in a way that depends on 


the particulars of the group and the representations. 
Consider a representation constructed by direct product: In matrix notation 
Gi=G el +1@aG, 
Using tr(A ® B) = tr(A)tr(B), and assuming tr(G;) = tr(G) = 0, we have 
tr(G,G;) = tr(I’)tr(GG,;) + tr(I)tr(G5G%) 
For example, for SU(N) (see subsection [B4 below) we can construct the adjoint rep- 
resentation from the direct product of the N-dimensional, “defining” representation 


and its complex conjugate. (We also get a singlet, but it will not affect the result for 
the adjoint.) In that case we find 


tra(GiG;) = 2N trp(GiG;) > es _ oN 


For most purposes, we use trp(G;G,;) = 6;; (cp = 1) for SU(N), so cg = 2N. 
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3. Determinants 


We now “review” some properties of determinants that will prove useful for the 
group analysis of the following subsections. Determinants can be defined in terms of 


the Levi-Civita tensor «. As a consequence of its antisymmetry, 
€ totally antisymmetric,  €12..n = 7] = ge = os, oe OF) 


since each possible numerical index value appears once in each €, so they can be 


matched up with 6’s. By similar reasoning, 


Kiva Koad vlna = oy oo .§in-m 
Jt 


1 
mi CK1...KmJ1-.JIn—mE pees 


where the normalization compensates for the number of terms in the summation. 


Exercise IB3.1 


Apply these identities to rotations in three dimensions: 

a Given only the commutation relations [J;;, J™] = id Ja! and the definition 
Gi = d6ijnJ jr, derive fj” = eijx- 

b Show the Jacobi identity €fij'€xl™ = 0 by explicit evaluation. 

c Find the Cartan metric, and thus the value of cy. 


This tensor is used to define the determinant: 
J_ 1 Theln J Tes J Tos 
det M, = SH sci = My, te.) My => €J,...J, M1, te... My = €7,...1,det M 


since anything totally antisymmetric in n indices must be proportional to the € tensor. 


This yields an explicit expression for the inverse: 


(M~*);," a maT inne Mi,” + - M7" (det M)"' 


From this follows a useful expression for the variation of the determinant: 


0 (apy 
ET Pe ie \y det M 


which is equivalent to 
6 In det M =tr(M~'dM) 


Replacing M with e™ gives the often-used identity 


6 In det eM =tr(e™@5de“) =tr 6M = det eM=e™™ 
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where we have used the boundary condition for M = 0. Finally, replacing M in 
the last identity with /n(1 + L) and expanding both sides to order L” gives general 


expressions for determinants of n x n matrices in terms of traces: 


det(1+L)=e™™O) = det L= (tr L)” - lr 7 \Ge LY? 44% 


1 
2(n—2)! 


Exercise IB3.2 
Use the definition of the determinant (and not its relation to the trace) to 
show 
det(AB) = det(A)det(B) 


These identities can also be derived by defining the determinant in terms of a 
Gaussian integral. We first collect some general properties of (indefinite) Gaussian 


integrals. The simplest such integral is 


2 27 Q love) 
Ge ee /2 — i : =f drre /? = -| due “=1 
T 


= oe x? /2 dx —x?/2 = dx —2?/2 Dye 1 
(27) = J20 Qn 


The complex form of this integral is 


— d?z —|z|? 
—~—e =] 
(254)? 


by reducing to real parameters as z = (x + iy)/\/2. These generalize to integrals 


involving a real, symmetric matrix S or a Hermitian matrix H as 


dex Ts d? * dz ; 
—a* Sax/2 __ t a —2'Hz _ tH -1 
lam Mere / (2Q7ri)? ee 


by diagonalizing the matrices, making appropriate redefinitions of the integration 
variables, and identifying the determinant of a diagonal matrix. Alternatively, we 
can use these integrals to define the determinant, and derive the previous definition. 
The relation for the symmetric matrix follows from that for the Hermitian one by 
separating z into its real and imaginary parts for the special case H = S. If we treat 
z and z* as independent variables, the determinant can also be understood as the 
Jacobian for the (dummy) variable change z + H~'z, z* — z*. More generally, if 
we define the integral by an appropriate limiting procedure or analytic continuation 
(for convergence), we can choose z and z* to be unrelated (or even separate real 


variables), and S and H to be complex. 
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Exercise IB3.3 
Other properties of determinants can also be derived directly from the integral 


definition: 


a Find an integral expression for the inverse of a (complex) matrix M by using 


the identity P 
_— —ztMz 
= ; Ozr (2 ) 


b Derive the identity 6 In det M = tr(M~!6M) by varying the Gaussian defi- 


nition of the (complex) determinant with respect to M. 


An even better definition of the determinant is in terms of an anticommuting 
integral (see subsection IA2), since anticommutativity automatically gives the anti- 
symmetry of the Levi-Civita tensor, and we don’t have to worry about convergence. 


We then have, for any matrix M, 
fart ac eM = det M 


where ¢T can be chosen as the Hermitian conjugate of ¢ or as an independent variable, 
whichever is convenient. From the definition of anticommuting integration, the only 
terms in the Taylor expansion of the exponential that contribute are those with the 
product of one of each anticommuting variable. Total antisymmetry in ¢ and in CT 
then yields the determinant; we define “d?¢t d?¢” to give the correct normalization. 
(The normalization is ambiguous anyway because of the signs in ordering the d¢’s.) 
This determinant can also be considered a Jacobian, but the inverse of the commuting 


result follows from the fact that the integrals are now really derivatives. 


Exercise IB3.4 


Divide up the range of a square matrix into two (not necessarily equal) parts: 


v=(6 5) 


and do the same for the (commuting or anticommuting) variables used in 


In block form, 


defining its determinant. Show that 


A B 
det (< s = det D- det(A — BD~'C) = det A+ det(D — CA™'B) 


a by integrating over one part of the variables first (this requires off-diagonal 


changes of variables of the form y — y + Ox, which have unit Jacobian), or 


b_ by first proving the identity 


A B\ (I BD"\ (A-BD"'C 0 IO 
ps 4G 7 0 D o> 
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We then have, for any antisymmetric (even-dimensional) matrix A, 
/ APE eS AS — PE A, (Pf A)? = det A 


by the same method as the commuting case (again with appropriate definition of the 
normalization of d?”€; the determinant of an odd-dimensional antisymmetric matrix 
vanishes, since det M = det M7). However, there is now an important difference: The 
“Pfaffian” is not merely the square root of the determinant, but itself a polynomial, 


since we can evaluate it also by Taylor expansion: 
— 1 Iy...lap 
Pf AT = Digp € ATE? Ai ibs 


which can be used as an alternate definition. (Normalization can be checked by 


examining a special case; the overall sign is part of the normalization convention.) 


4. Classical groups 


The rotation group in three dimensions can be expressed most simply in terms 
of 2x2 matrices. This description is the most convenient for not only spin 1/2, but 
all spins. This result can be extended to orthogonal groups (such as the rotation, 
Lorentz, and conformal groups) in other low dimensions, including all those relevant 


to spacetime symmetries in four dimensions. 


There are an infinite number of Lie groups. Of the compact ones, all but a finite 
number are among the “classical” Lie groups. These classical groups can be defined 
easily in terms of (real or complex) matrices satisfying a few simple constraints. (The 
remaining “exceptional” compact groups can be defined in a similar way with a little 
extra effort, but they are of rather specialized interest, so we won’t cover them here.) 
These matrices are thus called the “defining” representation of the group. (Sometimes 
this representation is also called the “fundamental” representation; however, this term 
has been used in slightly different ways in the literature, so we will avoid it.) These 


constraints are a subset of: 


volume: Special: det(g) =1 
hermitian: Unitary: g'Yg =P (Mea) 
metric: ieee { ees ging =n (nt =n) 
Symplectic: g' 2g =22 (QF = -2) 
reality: { apie g° = ng 
pseudoreal (*): g*¥ = QgQu! 
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where g is any matrix in the defining representation of the group, while Y,7, 22 are 
group “metrics”, defining inner products (while the determinant defines the volume, 
as in the Jacobian). For the compact cases Y and 7 can be chosen to be the identity, 
but we will also consider some noncompact cases. (There are also some uninteresting 
variations of “Special” for complex matrices, setting the determinant to be real or its 
magnitude to be 1.) 
Exercise IB4.1 
Write all the defining constraints of the classical groups (S, U, O, Sp, R, 
pseudoreal) in terms of the algebra rather than the group. 
Note the modified definition of unitarity, etc. Such things are also encountered 
in quantum mechanics with ghosts, since the resulting Hilbert space can have an 
indefinite metric. For example, if we have a finite-dimensional Hilbert space where 


the inner product is represented in terms of matrices as 
(Ix) = ¥'Tx 
then “observables” satisfy a “pseudohermiticity” condition 
(v/Hx) =(H¥\x) => YH=H'Y 
and unitarity generalizes to 
(UyUx) = Wx) => U'ru=Y7 


Similar remarks apply when replacing the Hilbert-space “sesquilinear” (vector times 
complex conjugate of vector) inner product with a symmetric (orthogonal) or anti- 
symmetric (symplectic) bilinear inner product. An important example is when the 
wave function carries a Lorentz vector index, as expected for a relativistic description 


of spin 1; then clearly the time component is unphysical. 


The groups of matrices that can be constructed from these conditions are then: 


GLa.0) SLO] UsUan) oT Re [ * | 
ae [0 [[slo@.n) 
R: GL(n) [SL(n)] Sp [| Sp(2n) | USp(2n,,2n-) 


*: [SJU*(2n) 


Of the non-determinant constraints, in the first column we applied none (“GL” means 


“general linear”, and “C” refers to the complex numbers; the real numbers “R” are 
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implicit); in the second column we applied one; in the third column we applied three, 
since two of the three types (unitarity, symmetry, reality) imply the third. (The 
corresponding groups with unit determinant, when distinct, are given in brackets.) 
These square matrices are of size n, ny+n_, 2n, or 2n,+2n_, as indicated. n, and 
n_ refer to the number of positive and negative eigenvalues of the metric Y or 7. 
O(n) differs from SO(n) by including “parity”-type transformations, which can’t be 
obtained continuously from the identity. (SSp(2n) is the same as Sp(2n).) For this 
reason, and also for studying “topological” properties, for finite transformations it 
is sometimes more useful to work directly with the group elements g, rather than 
parametrizing them in terms of algebra elements as g = e’°. U(n) differs from SU(n) 
(and similarly for GL(n) vs. SL(n)) only by including a U(1) group that commutes 
with the SU(n): Although U(1) is noncompact (it consists of just phase transforma- 
tions), a compact form of it can be used by requiring that all “charges” are integers 
(i.e., all representations transform as ~’ = e’%° for group parameter 6, where q is an 


integer defining the representation). 


Of these groups, the compact ones are just SU(n), SO(n) (and O(n)), and USp(2n) 
(all with n_=0). The compact groups have an interesting interpretation in terms of 
various number systems: SO(n) is the unitary group of nxn matrices over the real 
numbers, SU(n) is the same for the complex numbers, and USp(2n) is the same for 
the quaternions. (Similar interpretations can be made for some of the noncompact 
groups.) The remaining compact Lie groups that we didn’t discuss, the “exceptional” 
groups, can be interpreted as unitary groups over the octonions. (Unlike the classical 
groups, which form infinite series, there are only five exceptional compact groups, 


because of the restrictions following from the nonassociativity of octonions. ) 


5. Tensor notation 


Usually nonrelativistic physics is written in matrix or Gibbs’ notation. This is 
insufficient even for 19th century physics: We can write a column or row vector p 
for momentum, and a matrix T’ for moment of inertia, but how do we write in that 
notation more general objects? These are different representations of the rotation 


group: We can write how each transforms under rotations: 
p = pA, T =A'TA 


The problem is to write all representations. 


One alternative is used frequently in quantum mechanics: A scalar is “spin 0”, a 


vector is “spin 1”, etc. Spin s has 2s+1 components, so we can write a column “vector” 
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with that many components. For example, moment of inertia is a symmetric 3x3 
matrix, and so has 6 components. It can be separated into its trace S and traceless 


pieces R, which don’t mix under rotations: 
T=R+55I, tr(T)=S, ir(R)=0 


= ir(T’)=tr(A'TA) =itr(AA'T)=tr(T) = ir(R')=0, S'=S8 


using the cyclicity of the trace. Thus the “irreducible” parts of T are the scalar S 
and the spin-2 (5 components) R. But if we were to write R as a 5-vector, it would 
be a mess to relate the 5x5 matrix that rotates it to the 3x3 matrix A, and even 
worse to write a scalar like pRp’ in terms of 2 3-vectors and 1 5-vector. (In quantum 


mechanics, this is done with “Clebsch-Gordan-Wigner coefficients” .) 


The simplest solution is to use indices. Then it’s easy to write an object of 
arbitrary integer spin s as a generalization of what we just did for spins 0,1,2: It 
has s 3-vector indices, in which it is totally (for any 2 of its indices) symmetric and 
traceless: 

Tires. Ported 2 Pobetn Pot dn§, = 0 


and it transforms as the product of vectors: 
Wests. =. Paw ds L Ls 
De Se A aa 


Similar remarks apply to group theory in general: Although historically group 
representations have usually been taught in the notation where an m-component rep- 
resentation of a group defined by nxn matrices is represented by an m-component 
vector, carrying a single index with values 1 to m, a much more convenient and trans- 
parent method is “tensor notation”, where a general representation carries many 
indices ranging from 1 to n, with certain symmetries (and perhaps tracelessness) im- 
posed on them. (Tensor notation for a covering group is generally known as “spinor 
notation” for the corresponding orthogonal group: See subsection IC5.) This notation 
takes advantage of the property described above for expressing arbitrary representa- 
tions in terms of direct products of vectors. In terms of transformation laws, it means 
we need to know only the defining representation, since the transformation of this 


representation is applied to each index. 


There are at most four vector representations, by taking the dual and complex 
conjugate; we use the corresponding index notation. Then the group constraints 
simply state the invariance of the group metrics (and their complex conjugates and 


inverses), which thus can be used to raise, lower, and contract indices: 
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volume: Special: — €/t--/n 
hermitian: Unitary: ru 
metric: ; ; Orthogonal: ye 
(anti)symmetric: I 
Symplectic: 2 
Real: ny 
reality: 
. { pseudoreal (*): ay 


As a result, we have relations such as 
(y= nl or Ql, (J) yi 
We also define inverse metrics satisfying 
iSO” Oe = aa oe =; 


(and similarly for contracting the second index of each pair). Therefore, with uni- 
tarity/(pseudo)reality we can ignore complex conjugate representations (and dotted 
indices), converting them into unconjugated ones with the metric, while for orthogo- 
nality/symplecticity we can do the same with respect to raising /lowering indices: 

Unitary: yl = Tip, 

Orthogonal: a! = 74 

Symplectic: ww! = Qu, 

Real: W; = ny wy 

pseudoreal (*): Wy = Q,7v7 


For the real groups there is also the constraint of reality on the defining representation: 
oy = (w1)* = vy Sy vs 


Exercise IB5.1 
As an example of the advantages of index notation, show that SSp is the 
same as Sp. (Hint: Write one ¢€ in the definition of the determinant in terms 
of 2’s by total antisymmetrization, which then can be dropped because it 
is enforced by the other «. One can ignore normalization by just showing 
det M = det I.) 


For SO(n,,n_), there is a slight modification of a sign convention: Since then 


Liss 


indices can be raised and lowered with the metric, €’ is usually defined to be the 


result of raising indices on €;.., which means 


fa.n=1l => 6%" = det n= (-1)* 
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Then ¢/: should be replaced with (—1)"-e!" in the equations of subsection IB3: For 
example, 


Stone = (“185  O 


We now give the simplest explicit forms for the defining representations of the 
classical groups. The most convenient notation is to label the generators by a pair 
of fundamental indices, since the adjoint representation is obtained from the direct 
product of the fundamental representation and its dual (i.e., as a matrix labeled by 
row and column). The simplest example is GL(n), since the generators are arbitrary 
matrices. We therefore choose as a basis matrices with a 1 as one entry and 0’s 
everywhere else, and label that generator by the row and column where the 1 appears. 
Explicitly, 

GL(n): (Gr )xk" =b75% => Gr =|") 


This basis applies for GL(n,C) as well, the only difference being that the coefficients 
ain G=a,/G,' are complex instead of real. The next simplest case is U(n): We can 
again use this basis, although the matrices G;’ are not all hermitian, by requiring 
that a,’ be a hermitian matrix. This turns out to be more convenient in practice 
than using a hermitian basis for the generators. A well known example is SU(2), 
where the two generators with the 1 as an off-diagonal element (and 0’s elsewhere) 


are known as the “raising and lowering operators” J, and are more convenient than 


their hermitian parts for purposes of contructing representations. (This generalizes 
to other unitary groups, where all the generators on one side of the diagonal are 
raising, all those on the other side are lowering, and those along the diagonal give the 
maximal Abelian subalgebra, or “Cartan subalgebra” .) 


Representations for the other classical groups follow from applying their defini- 
tions to the GL(n) basis. We thus find 


SL(n):  (Gr’)«" = 575% — 5579K = Gr = |") — 357 /* XI 
SO(n): (Gri)*" = 6F6_ > Grr = |) (a 
Sp(n):  (Grx)*" = 566 > Grr =a) (aI 
As before, SL(n,C) and SU(n) use the same basis as SL(n), etc. For SO(n) and Sp(n) 
we have raised and lowered indices with the appropriate metric (so SO(n) includes 


SO(n,,n_)). For some purposes (especially for SL(n)), it’s more convenient to impose 


tracelessness or (anti)symmetry on the matrix a, and use the simpler GL(n) basis. 


Exercise IB5.2 
Our normalization for the generators of the classical groups is the simplest, 


and independent of n (except for subtracting out traces): 
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a Find the commutation relations of the generators (structure constants) for the 
defining representation of GL(n) as given in the text. Note that the values of 


all the structure constants are 0, +2. Show that 
Cp = i. 


(see subsection IB2). 


b Consider the GL(m) subgroup of GL(n) (m<n) found by restricting the range 
of the index of the above defining representation. Show the structure con- 
stants are the same as those given by starting with the above representation 
of GL(m). 

c Find the structure constants for SO(n) and Sp(n). 


d Directly evaluate kpc, (= 6%G,;G;) for SL(n), SO(n), and Sp(n), and compare 
with cpda/dp. 


Exercise IB5.3 


A tensor that pops up in various contexts is 
ier = ir(GiiGs, G;}) 


It takes a very simple form in terms of defining indices: 


a Show that for SU(n) this tensor is determined to be, up to an overall normal- 

ization (that depends on the representation), 
2 4 
flGA™ (G57, Gy hh [(231) + (312)] — —[(132) + (213) + (321)] +5 (123) 
(abc) = Caran 

(where a,b,c are some permutation of 1,2,3) from just the total symmetry 
of dijx (and G;' = 0), since the only invariant tensor available is 67. (If ery... 
were used, ¢/% would also be required, to balance the number of subscripts 


and superscripts; but their product can be expressed in terms of just 6’s also.) 


b Check this result by using the explicit G’s for the defining representation, and 


determine the proportionality constant for that representation. 


With the exception of the “spinor” representations of SO(n) (to be discussed 
in subsection IC5, section IA, and subsection XC1), general representations can be 
obtained by reducing direct products of the defining representations. This means they 
can be described by objects with multiple indices (up/down, dotted/undotted), where 
each index is that of a defining representation, and satisfying various (anti)symmetry 


and tracelessness conditions on the indices. 
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Exercise IB5.4 
Consider the representations of SU(n) obtained from the symmetric and an- 
tisymmetric part of the direct product of two defining representations. For 
simplicity, one can work with the U(n) generators, since the U(1) pieces will 


appear in a simple way. 


a Using tensor notation for the generators (G77)x,™“%, find their explicit rep- 


resentation for these two representations. 


b By evaluating the trace, show that the Dynkin index for the two cases is 
oe C5 =n+2 


c Show the sum of these two is consistent with the argument at the end of 
subsection IB2. Show each case is consistent with n=2, and the antisymmetric 
case with n=3, by relating those cases to the singlet, defining, and adjoint 


representations. 
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Siergietcaesaedscas C. REPRESENTATIONS .......00.0........ 


We now consider some of the more useful representations, as explicit examples 
of the results of the previous section. In particular, we consider symmetries of the 


quark model. 


1. More coordinates 


We began our “review” of group theory by looking at how symmetries were rep- 
resented on coordinates. We now return to coordinates as a special case (particular 
representation) of the general results of the previous section. The idea is that the 
coordinates themselves are already a representation of the group, and the wave func- 
tions are functions of these coordinates. For example, for ordinary rotations we use 
wave functions that depend on position or momentum, which transforms as a vec- 
tor. (This is not always the case: For example, in our description of the conformal 
group the usual space and time coordinates transformed nonlinearly, and not just 
by multiplication by constant matrices unless the extra two coordinates were intro- 
duced.) This is the basic distinction between classical mechanics and classical field 
theory: Mechanics uses the coordinates themselves as the basic variables, while field 
theory uses functions of the coordinates. (Similarly, in quantum mechanics the wave 
functions are functions of the coordinates, while in quantum field theory the wave 


functions are “functionals” of functions of the coordinates.) 


In general, the construction of such a “coordinate representation” starts with a 
given matrix representation (usually finite dimensional) (G;);7 and then defines a 


new representation 
Gi=q'(Gi)rps; — [prna’}=67, [a,a}=[p,p} =0 


for some objects g and p, which are interpreted as either coordinates and their con- 
jugate momenta (up to a factor of 7), or as creation and annihilation operators: The 
latter nomenclature is used when the boundary conditions allow the existence of a 
state |0) called the “vacuum”, satisfying p|0) = 0, so we can define the other states 
as functions of qg acting on |0). (If the coordinates are fermionic, the distinction is 
moot, since by the usual Taylor expansion the Hilbert space is finite dimensional. See 
exercise [A2.3.) It is easy to check that en satisfy the same commutation relations 
as G;. In particular, if the matrices are in the adjoint representation, g’ can be inter- 
preted as the group coordinates themselves: This follows from considering the action 
of an infinitesimal transformation on the group element g(q) =e’ (or just the Lie 
algebra element G(q) = q'G;). 
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If we write these results in bra/ket notation, since 
Gig! = a (Gar, Gip = —(Gi)r"ps 
it is more natural to look at the action on bras: 
l=a'Cl, |p) = Yer => Gilgl = (alGi,  Gilp) = —Gilp) 


Note that this vector space is coordinate space itself, not the space of functions of 
the coordinates; it is the same space on which G; is defined. (Of course, G; is defined 
on arbitrary functions of the coordinates; it has a reducible representation bigger 
than (G;);7. Effectively, (G;);7 is represented on the space of functions linear in the 


coordinates.) Then, for example 
G1Go(q| a Gi(q|Go = (q|GiGe 
is obviously equivalent, while (ignoring any extra signs for fermions) 
GG2|p) = —GG2|p) = —G2Gi|p) = G2G|p) 

at least gives an equivalent result for the commutator algebra Gi, Gol. This is the 
expected result for the dual representation G; — —G?. 

Interesting examples are given by using the defining representation for G. For 
example, the commonly used oscillator representation for U(n) is 

U(n): Gy’ =alay, (az, a'?} = 67 


where the oscillators can be bosonic or fermionic. For the SO and Sp cases, because 
we can raise and lower indices, and because of the (anti)symmetry on the indices, the 
interesting possibility arises to identify the coordinates with their momenta, with the 


statistics appropriate to the symmetry: 
Sp(n): Gry =4euzy, [21,25] = 21 


SO(n) : Gry = sua» (0 Ya} = 0s 
For SO(n) the representation is finite dimensional because of the Fermi-Dirac statis- 
tics, and is called a “Dirac spinor” (and y the “Dirac matrices”). If the opposite 
statistics are chosen, the coordinates and momenta can’t be identified: For example, 
bosonic coordinates for SO(n) give the usual spatial rotation generators Gry = Lr 7)- 
Exercise IC1.1 
Use this bosonic oscillator representation for U(2)=SU(2)@U(1), and use the 
SU(2) subgroup to describe spin. 
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a Show that the spin s (the integer or half-integer number that defines the rep- 
resentation) itself has a very simple expression in terms of the U(1) generator. 
Show this holds in the quantum mechanical case (by interpreting the bracket 
as the quantum commutator), giving the usual s(s + 1) for the sum of the 


squares of the generators (with appropriate normalization). 


b Use this result to show that these oscillators, acting on the vacuum state, can 


be used to construct the usual states of arbitrary spin s. 


Exercise IC1.2 
Considering SO(2n), divide up y; into pairs of canonical (and complex) con- 
jugates a, = (71 + ty2)/V2, etc., so {a,at} = 1. 
a Write the SO(2n) generators in terms of aa, alat, and ata. Show that the 


a'a’s by themselves generate a U(n) subgroup. 


b Decompose the Dirac spinor into U(n) representations. Show that the product 
of all the y’s is related to the U(1) generator, and commutes with all the 
SO(2n) generators. Show that the states created by even or odd numbers of 
a'’s on the vacuum don’t mix with each other under SO(2n), so the Dirac 


spinor is reducible into two “Weyl spinors”. 


2. Coordinate tensors 


We have just seen how groups can be represented on coordinates. Depending 
on the choice of coordinates, the coordinates may transform nonlinearly (i.e., as a 
realization, not a representation), as for the D-dimensional conformal group in terms 
of D (not D+2) coordinates. However, given the nonlinear transformation of the 
coordinates, there are always representations other than the defining one (scalar field) 
that we can immediately write down (such as the adjoint). We now consider such 
representations: These are useful not only for the spacetime symmetries we have 
already considered, but also for general relativity, where the symmetry group consists 
of arbitrary coordinate transformations. Furthermore, these considerations are useful 
for describing coordinate transformations that are not symmetries, such as the change 
from Cartesian to polar coordinates in nonrelativistic theories. 

When applied to quantum mechanics, we write the action of a symmetry on a 
state as dW = iGw (or y = e’%y), but on an operator as 6A = i[G, A] (or A’ = 
e'?Ae~'®). In classical mechanics, we always write 6A = i[G, A] (since classical 


objects are identified with quantum operators, not states). However, if G = X'"0,, is 
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a coordinate transformation (e.g., a rotation) and ¢ is a scalar field, then in quantum 


notation we can write 


56(x) = [G,d]=Ge=A"In@ ($= e® Ge = €%8) 


since the derivatives in G just differentiate ¢. (For this discussion of coordinate 
transformations we switch to absorbing the 7’s into the generators.) The coordinate 


transformation G has the usual properties of a derivative: 
IG, f@l=Gf = Ghh=(G,fifl =(Gh)ft+ AGfe 
e fife = efi fre ° = (fie “)(e fae) = (e* fi)“ fa) 


and similarly for products of more functions. 


The adjoint representation of coordinate transformations is a “vector field” (in the 
sense of a spatial vector), a function that has general dependence on the coordinates 


(like a scalar field) but is also linear in the momenta (as are the Poincaré generators): 
GH" wn, VSV" elon = V=(6 VO OV" =v "ean On 


= OV" =O," =V"O,A" 


The same result follows if we use the Poisson bracket instead of the quantum me- 


chanical commutator, replacing O,, with ip,, in both G and V. 


Finite transformations can also be expressed in terms of transformed coordinates 


themselves, instead of the transformation parameter: 
(a) = "Pn g!(0) = g(a) 


as seen, for example, from a Taylor expansion of ¢’, using e~°d’ = e~%¢/e®. We then 
define 
Ha’) =4(0) af eng 


This is essentially the statement that the active and passive transformations cancel. 
However, in general this method of defining coordinate transformations is not con- 
venient for applications: When we make a coordinate transformation, we want to 
know ¢'(x). Working with the “inverse” transformation on the coordinates, i.e., our 
original e*®, 

etme => b(a) = €6(2) = $(F(2)) 


x 
So, for finite transformations, we work directly in terms of (2), and simply plug this 


into @ in place of x (a — £()) to find @’ as a function of x. 
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Similar remarks apply for the vector, and for derivatives in general. We then use 
g=e°%o = =e "Oe" 
where 0’ = 0/02’, since O’x! = Ox = 6. This tells us 
Vn), =e °V ™"(@0.2° =V (2 \d, 


or V’(a2’) = V(x). Acting with both sides on 2’, 


Ox'™ 


Ox” 


On the other hand, working in terms of £ is again more convenient: Changing the 


Vr) = (a) 


transformation for the vector operator in the same way as the scalar 


= Vand, =V" Zen 
Ox™ 


=> Va)eV"a) aan 


where 0 = 0/0% and as usual 


emiemiar — “2 — (22) 7 


on™ 


We can also use 
ViGg)=eViaje @ 


=> V!™(2)Om = (CCV (x)e~%) (CF One 9) = V"(Z) On 


A “differential form” is defined as an infinitesimal W = dz™W,,(x). Its transfor- 
mation law under coordinate transformations, like that of scalar and vector fields, is 
defined by W’(x’) = W(a). For any vector field V = V™(x)0,,, V"W,,, transforms as 


a scalar, as follows from the “chain rule” d = da’"0!, = dz™0,,. Explicitly, 


Wi, (a!) = Wal) 


or in infinitesimal form 


OW =A OW Waetek 


Thus a differential form is dual to a vector, at least as far as the matrix part of coor- 
dinate transformations is concerned. They transform the same way under rotations, 


because rotations are orthogonal; however, more generally they transform differently, 
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and in the absence of a metric there is not even a way to relate the two by raising or 


lowering indices. 


Higher-rank differential forms can be defined by antisymmetric products of the 
above “one-forms”. These are useful for integration: Just as the line integral [W = 
J dz™W,, is invariant under coordinate transformations by definition (as long as we 
choose the curve along which the integral is performed in a coordinate-independent 
way), so is a totally antisymmetric Nth-rank tensor (“N-form”) Wyn,...m, integrated 


on an N-dimensional subspace as 


Ox? OxPN 
pam seed WG eee 2 Ww; es Wo, -py(2)——— ee 


mi MN Oxr'™ Ox!™N 


where the surface element dx™ ---dx™ is interpreted as antisymmetric. (The signs 
come from switching initial and final limits of integration, as prescribed by the “ori- 
entation” of the hypersurface.) This is clear if we rewrite the integral more explicitly 


in terms of coordinates o* for the subspace: Then 
| dig da” Wray. (2) = / do" do! Wiyuiy (0) = / dNo YW, in (0) 


where 


= 3a 0 Wenn (2) 
is the result of a coordinate transformation that converts N of the x’s to o’s, an 
interpretation of the functions x(c) that define the surface. Then any coordinate 
transformation on x — x’ (not on a) will leave W(o) invariant. In particular, if the 
subspace is the full space, so we can look directly at f[d%a €™°™’ Win, my, We See 
that a coordinate transformation generates from W an N-dimensional determinant 


exactly canceling the Jacobian resulting from changing the integration measure dx. 


Exercise [C2.1 
For all of the following, use the exponential form of the finite coordinate 
transformation: 

a Show that any (local) function of a scalar field (without explicit x dependence 
additional to that in the field) is also a scalar field (i.e., satisfies the same 
coordinate transformation law). 

b Show that the transformation law of a vector field or differential form remains 
the same when multiplied by a scalar field (at the same 2). 

c Show that Vé = V"O,n¢ is a scalar field for any scalar field ¢ and vector field 
¥, 


d Show that [V,W] is a vector field for any vector fields V and W. 
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Exercise IC2.2 
Examine finite coordinate transformations for integrals of differential forms 
in terms of “ rather than x’. Find the explicit expression for W’(a) in terms 


of W(z(x)), etc., and use this to show invariance: 


mi MN 


pum dz ™NWw! (z) = pam +0 dE™§ Wing .-my (2) 


= 7 a Sd dial || Semreren 


where in the last step we have simply substituted  — x as a change of integra- 
tion variables. Note that, using the 7 form of the transformation rather than 
x’, the transformation generates the needed Jacobian, rather than canceling 


one. 


From the above transformation law, we see that the curl of a differential form is 
also a differential form: 


Ohne Vi (2!) = Bima, (Ong B??) + + (On y)@?™) Woop (&) 


[my "" m2--my] 
— [Op Woop] (2)|(O., a") med (0,0) 


because the curl kills 0’0’x terms that would appear if there were no antisymmetriza- 
tion. Objects that transform “covariantly” under coordinate transformations, without 
such higher derivatives of x (or A in the other notation), like scalars, vectors, differen- 
tial forms and their products, are called (coordinate) “tensors”. Getting derivatives of 
tensors to come out covariant in general requires special fields, and will be discussed 
in chapter IX. An important application of the covariance of the curl of differential 
forms is the generalized Stokes’ theorem (which includes the usual Stokes’ theorem 


and Gauss’ law as special cases): 


pam = «deg N+ oe hors Werna--rr 4 = pau” LY eis cocpgy 


where the second integral is over the boundary of the space over which the first is 
integrated. (We use the symbol “f” to refer to boundary integrals, including those 
over contours, which are closed boundaries of 2D surfaces.) It is basically just the 
fundamental theorem of (integral) calculus ( de dx f'(x) = f(b) — f(a)), as is clear 
from choosing a coordinate system where the boundary is at a fixed value of just one 
coordinate (at least in patches). (A standard example is a pair of infinite constant- 


time surfaces, neglecting the boundaries that connect them at spatial infinity.) 
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3. Young tableaux 


We now return to our discussion of finite-dimensional representations. In the 
previous section we gave the machinery for describing them using index notation, 
but examined only the defining representation in detail. Now we analyze general 


irreducible representations. 


All the irreducible finite-dimensional representations of the groups SU(N) can be 
described by tensors with lower N-valued indices with various (anti)symmetrizations. 
(An upper index can be replaced with N—1 lower indices by using the Levi-Civita 
tensor.) Although detailed calculations require explicit use of these indices, three 


properties can be more conveniently discussed pictorially: 
(1) the (anti)symmetries of the indices, 
(2) the dimension (number of independent components) of the representation, and 


(3) the reduction of the direct product of two representations (which irreducible rep- 


resentations result, and how many of each). 


A “Young tableau” is a picture representing an irreducible representation in terms 
of boxes arranged in a regular grid into rows and columns, such that the columns are 


aligned at the top, and their depths are nonincreasing to the right: for example, 


cial 


Each box represents an index, with antisymmetry among indices in any column, and 
symmetry among indices in any row. More precisely, since one can’t simultaneously 
have these symmetries and antisymmetries, it corresponds to the result of taking any 
arbitrary tensor with that many indices, first symmetrizing the indices in each row, 
and then antisymmetrizing the indices in each column (or vice versa; symmetrizing 
and then antisymmetrizing and then symmetrizing again gives the same result as 
skipping the first symmetrization, etc.). This gives a simple way to classify and 
symbolize each representation. (We can denote the singlet representation, which has 
no boxes, by a dot.) Note that the deepest column should have no more than N—1 


boxes for SU(N) because of the antisymmetry. 


To calculate the dimension of the representation for a given tableau, we use the 


“factors over hooks” rule: 


(1) Write an “N” in the box in the upper-left corner, and fill the rest of the boxes 
with numbers that decrease by 1 for each step down and increase by 1 for each 


step to the right. 


100 I. GLOBAL 


(2) Draw (or picture in your mind) a “hook” for each box — a “I’” with its corner 


in the box and lines extending right and down out of the tableau. 


(3) The dimension is then given by the formula 


integer written there 


dimension = ———————— 
II # boxes intersected by its hook 
each box 


For the previous example, we find (listing boxes first down and then to the right) 


N N-1 N-2 N-3 N+1 N N-1 N+2 N41 N+3 N4+2 N+4 


The direct product of two Young tableaux A®B is analyzed by the following rules: 
First, label all the boxes in B by putting an “a” in each box in the top row, “b” in 
the second row, etc. Then, take the following steps in all possible ways to find the 


Young tableaux resulting from the direct product: 


(1) Add all the “a” boxes from B to the right side and bottom of A, then “b” to 
the right and bottom of that, etc., to make a new Young tableaux. Any two 
tableaux constructed in this way with the same arrangement of boxes but different 
assignment of letters are considered distinct, i.e., multiple occurences of the same 
representation in the direct product. 

(2) No more than 1 “a” can be in any column, and similarly for the other letters. 

(3) Reading from right to left, and then from top to bottom (i.e., like Hebrew/Arabic), 


the number of a’s read should always be > the number of b’s, b’s > c’s, etc. 


De hf =f okie KP 


Note that A®B always gives the same result as B@A, but one way may be simpler 


For example, 


than the other. For a given value of N, a column of N boxes is equivalent to none 
(again by antisymmetry), while more than N boxes in a column gives a vanishing 
tableau. 

Exercise IC3.1 


Calculate 

L3H 
Check the result by finding the dimensions of all the representations and 
adding them up. 


These SU(N) tableaux also apply to SL(N): Only the reality properties are dif- 
ferent. Similar methods can be applied to USp(2N) (or Sp(2N)), but tracelessness 
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(with respect to the symplectic metric) must be imposed in antisymmetrized indices, 
so these trace pieces must be separated out when considering the above rules. (Le., 
consider USp(2N)CSU(2N).) Similar remarks apply to SO(N), which has a symmet- 
ric metric, but there are also “spinor” representations (see below). The additional 
irreducible representations then can be constructed from taking direct products of 
the above with the smallest spinors, and removing the “gamma-matrix” traces. Fur- 
thermore, using the Levi-Civita tensor, all columns can be reduced to no more than 
N/2 in height. 


4. Color and flavor 


We now consider the application of these methods to “internal symmetries” (those 
that don’t act on the coordinates) in particle physics. The symmetries with experi- 
mental confirmation involve only the unitary groups (U and SU) of small dimension. 
However, we will find later that larger unitary groups can be useful for approxima- 
tion schemes. (Also, larger unitary and other groups continue to be investigated for 


unification and other purposes, which we consider in later chapters.) 


The “Standard Model” describes all of particle physics that is well confirmed 
experimentally (except gravity, which is not understood at the quantum level). It 


includes as its “fundamental” particles: 


(1) the spin-1/2 quarks that make up the observed strongly interacting particles, but 
do not exist as asymptotic states, 

(2) the weakly interacting spin-1/2 leptons, 

(3) the spin-1 gluons that bind the quarks together, which couple to the charges 
associated with SU(3) “color” symmetry, but also are not asymptotic, 

(4) the spin-1 particles that mediate the weak and electromagnetic interactions, which 


couple to SU(2)@U(1) “flavor”, and 


(5) the yet unobserved spin-0 Higgs particles that are responsible for all the masses 


of these weakly interacting particles. 


(However, quarks and gluons are temporarily free at high energy, eventually recom- 
bining to give rise to “jets”, clusters of resulting hadrons.) These particles, along 


with their masses (in GeV) and (electromagnetic) charges (Q = Q + AQ), are: 
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lepton (1) ieee > oa oe 
rey savor (2) | (Q = —3) flavor ( 


Jone 0 ly (< 6 - 10-28) | 
(< 3210-7 Z (91.188) 


aan W (80.42) 
ae 00019) 


r (1.7770) 
Vv, (<.0182) 


—_—_ an 
i 
+3 
at 
i 
+3 
at 
i 
+3 


The quark masses we have listed are the “current quark masses”, the effective masses 
when the quarks are relativistic with respect to their hadron (at least for the lighter 
quarks), and act as almost free. But since they are not free, their masses are ambigu- 
ous and energy dependent, and defined by some convenient conventions. Nonrelativis- 
tic quark models use instead the “constituent quark masses”, which include potential 
energy from the gluons. This extra potential energy is about .30 GeV per quark in 
the lightest mesons, .35 GeV in the lightest baryons; there is also a contribution to 
the binding energy from spin-spin interaction. Unlike electrodynamics, where the po- 
tential energy is negative because the electrons are free at large distances, where the 
potential levels off (the top of the “well” ), in chromodynamics the potential energy is 
positive because the quarks are free at high energies (short distances, the bottom of 
the well), and the potential is infinitely rising. Masslessness of the gluons is implied 
by the fact that no colorful asymptotic states have ever been observed. We have 
divided the spin-1/2 particles into 3 “families” with the same quantum numbers (but 
different masses). Within each family, the quarks are similar to the leptons, except 
that: 
(1) the masses and average charges (Q) are different, 
(2) the quarks come in 3 colors, while the leptons are colorless, and 
(3) the neutrinos, to within experimental error, are massless, so they have half as 
many components as the massive fermions (1 helicity state each, instead of 2 spin 
states each). 
This means that each lepton family has 1 SU(2) doublet and 1 SU(2) singlet. For 
symmetry (and better, quantum mechanical, reasons to be explained later), we also 


assume the quarks have 1 SU(2) doublet, but therefore 2 SU(2) singlets. (Some exper- 


iments have indicated small masses for neutrinos: This would require generalization 
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of the Standard Model, such as models with parity broken by interactions. Some 


examples of such theories will be discussed in subsection IVB4.) 


We first look at the color group theory of the physical states, which are color 
singlets. The fundamental unobserved particles are the spin-1 “gluons”, described by 
the Yang-Mills gauge fields, and the spin-1/2 quarks. Suppressing all but color indices, 
we denote the quark states by q’, and the antiquarks by q';, where the indices are those 
of the defining representation of SU(n), and its complex conjugate. The quarks also 
carry a representation of a “flavor” group, unlike the gluons. The simplest flavorful 
states are those made up of only (anti)quarks, with indices completely contracted 
by one factor of an SU(n) group metric: From the “U” of SU(n), we can contract 
defining indices with their complex conjugates, giving the “mesons”, described by 
qq’ (quark-antiquark), which are their own antiparticles. From the “S” of SU(n), 
we have the “baryons”, described by ¢;,..;,q"'...¢’ (n-quark), and the antibaryons, 
described by the complex conjugate fields. All other colorless states made of just 
(anti)quarks can be written as products of these fields, and therefore considered as 
describing composites of them. Thus, we can approximate the ground states of the 
mesons by 

q'i(x)q' (a) 
which describe spins 0 and 1 because of the various combinations of spins (from 
+@4=061). The first excited level will then be described by 


g'0q= q'0q- (0q')q 


The antisymmetric derivative picks out the relative momentum of the two quarks, 
rather than the total, and thus introduces orbital angular momentum 1 (and simi- 
larly for more such derivatives). This level thus includes spins 0, 1, and 2. (Similar 
remarks apply to baryons.) We can also have flavorless states made from just gluons, 
called “glueballs”: The ground states can be described by F;/F;’, where each F is a 
gluon state (in the adjoint representation of SU(n)), and includes spins 0 and 2 (from 
the symmetric part of 1 @ 1). Because of their flavor multiplets and (electroweak) 
interactions, many mesons and baryons corresponding to such ground and excited 
states have been experimentally identified, while the glueballs’ existence is still un- 
certain. Actually, quarks and gluons can almost be observed independently at high 
energies, where the “strong” interaction is weak: The energetic particle appears as 
a “jet” — a particle of high energy accompanied by particles of much lower energy 
(perhaps too small to detect) in color-singlet combinations. (Depending on the avail- 
able decay modes, the jet might not be observed until after decaying, but still within 


a small angle of spread.) 
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Just as all physical states are singlets of the local color SU(3), they are also singlets 
of the local SU(2) of electroweak interactions. As will be explained later (subsection 
IVB2), there are four Higgs fields, which transform simultaneously as a doublet of 
this local SU(2) and a doublet of a broken, global isospin SU(2). (The determinant 
of this 2x2 matrix gives the observable singlet Higgs.) For example, the proton and 
neutron, which have close but different mass, are a doublet of this global SU(2). 
Unlike the confinement responsible for SU(3) singlets, which is nonperturbative, the 
Higgs mechanism responsible for SU(2) singlets is perturbative, since the scalar Higgs 
fields can be expanded about their “vacuum values”, which are just numbers: SU(2) 
singlets can be found from multiplying general fields by Higgs scalars, which trade the 
local SU(2) for the global one, while there are no scalars that transform under SU(3), 
and giving a vacuum value to a field with spin would violate Lorentz invariance. 
Ironically, while the Higgs is easy to describe theoretically, but hasn’t been found yet 
experimentally, confinement is the opposite. However, they look similar: Both have 
(lowest mass) composite scalars of the form Ww and vectors of the form WTiV aw that 
are singlets under their (nonabelian) gauge group, where w is a scalar field for Higgs 
and a spinor (fermion) field for confinement. Classically they seem quite different, but 


the quantum relation is still unclear. Supersymmetry might provide some relation. 


We now look at the flavor group theory of the physical hadronic states. In contrast 
to the previous paragraph, we now suppress all but the flavor indices. Mesons M;/ = 
qq) are thus in the adjoint representation of flavor U(m) (m @ m, where m is the 
defining representation and m its complex conjugate), for both the spin-0 and the 
spin-1 ground states. The baryons are more complicated: For simplicity we consider 
SU(3) color, which accurately describes physics at observed energies. Then the color 
structure described above results in total symmetry in combined flavor and Lorentz 
indices (from the antisymmetry in the color indices, and the overall antisymmetry for 


Fermi-Dirac statistics). Thus, for the 3-quark baryons, the Young tableaux 


HefPenn 


for SU(m) flavor are accompanied by the same Young tableaux for spin indices: In 
nonrelativistic notation, the first tableau, being totally antisymmetric in flavor in- 
dices, is also totally antisymmetric in the three two-valued spinor indices, and thus 
vanishes. Similarly, the last tableau describes spin 3/2 (total symmetry in both types 
of indices), while the middle one describes spin 1/2. Since only 3 flavors of quarks 
have small masses compared to the hadronic mass scale, hadrons can be most conve- 


niently grouped into flavor multiplets for SU(3) flavor: The ground states are then, 
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in terms of SU(3) flavor multiplets, 81 for the pseudoscalars, 861 for the vectors, 8 
for spin 1/2, and 10 for spin 3/2. 


Exercise IC4.1 
What SU (flavor) Young tableaux, corresponding to what spins, would we have 


for mesons and baryons if there were 
a 2 colors? 
b 4 colors? 


However, the differing masses of the different flavors of quarks break the SU(3) 
flavor symmetry (as does the weak interaction). In particular, the mass eigenstates 
tend to be pure states of the various combinations of the different flavors of quarks, 
rather than the linear combinations expected from the flavor symmetry. Specifically, 
the linear combinations predicted by an 8@1 separation for mesons (trace and traceless 
pieces of a 3®3 matrix) are replaced with particles that are more accurately described 
by a particular flavor of quark bound to a particular flavor of antiquark. (This is 
known as “ideal mixing”.) The one exception is the lighest mesons (pseudoscalars), 
which are more accurately described by the 81 split, for this restriction to the 3 
lighter flavors of quarks, but the mass of the singlet differs from that naively expected 
from group theory or nonrelativistic quark models. (This is known as the “U(1) 
problem”.) The solution is probably that the singlet mixes strongly with the lightest 
psuedoscalar glueball (described by tr Salas 8 | the mass eigenstates are linear 
combinations of these two fields with the same quantum numbers. In any case, the 
most convenient notation for labeling the entries of the matrix M;/ representing the 
various meson states for any particular spin and angular momentum of the quark- 
antiquark combination is that corresponding to the choice we gave earlier for the 
generators of U(n): Label each entry by a separate name, where the complex conjugate 
appears reflected across the diagonal. These directly correspond to the combination 
of a particular quark with a particular antiquark, and to the mass eigenstates, with 
the possible exception of the entries along the diagonal for the 3 lightest flavors, 
where the mass eigenstates are various linear combinations. (However, the SU(2) of 
the 2 lightest flavors is only slightly broken by the quark masses, so in that case the 
combinations are very close to the 361 split of SU(2).) 


For example, for the lightest multiplet of mesons (spin 0, and relative angular mo- 
mentum 0 for the quark and antiquark, but not all of which have yet been observed), 


we can write the U(6) matrix (for the 6 flavors of the 3 known families) 
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dd du ds dec db dt 


ud wu us uc ub wut 


8d su 8s 8c 8b S&8t 


M, = 

cd Ci Cs. €¢- cD Ci 

bd bu bs be bb bt 

td tu ts tec tb tt 
d u 8 c b t 
d {1a m+(.1395702) K°(.49765) Dt(1.8693) B°(5.2794) Tt 
alm (") tM K~(.49368) D°(1.8645) B-(5.2790) T° 
_ | KX) Kt") Ns D+(1.9682) B°(5.370) T+ 
© 56.) DH) DO) DY) ne(2.980)  B>(6.4) 7? 
Beet) Bey) Be) Be) Mb Ty 
a oa qe si To i m 


where (approximately) 


Ty = ygt(.1349766) + 5[n'(-9578) + n(.5478)] 


Na=— yar + 3(n' +7), Ns = Yg(n' —n) 
in terms of the mass eigenstates (observed particles), with masses again in GeV, 


and ditto marks refer to the transposed entry. (We have neglected the important 


contribution from the glueball.) For the corresponding spin-1 multiplet, 


d U 8 Cc b t 
d [wa prcripp) "|. 8961). **(2:0100), 2 *(6.2250) 2 
al p(") wy K*-(.8917) D*°(2.0067) B*-(5.3250) T*° 
agi 5 | KC") Ke) (1.01946) D*F(2.1120) B*9(5.417) T*T 
“ @ | D&(") D*(")  D¥E(") J/1(3.09692) B*; T*O 
b | BA) BY*(") Be") B*. Y(9.4603) 9 T*Y 
a ae Teo jin 7 is 0 
where 


Wu = 75 lw(-7826) + p"(.7758)], wa = Fa(w — p”) 


(with ss = ¢, ideal mixing, also approximate). 
Exercise [C4.2 
Check the consistency of the masses in the second mass matrix above by as- 
suming the meson masses are just the sum of the “constituent” quark masses: 


See how close a fit you can get. (Potential energies have just been lumped 
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into the quark masses, assuming they are the same throughout the multiplet. 
Note that masses on the diagonal will come out a bit low from annihilation 
effects. The first multiplet was not used because of complications from mix- 
ing with glueballs. Similar mass relations can be obtained from group theory 


arguments, but the underlying physics is explained by the quark model.) 


5. Covering groups 


The orthogonal groups O(n;,n_) are of obvious interest for describing Lorentz 
symmetry in spacetimes with n, space and n_ time dimensions, or conformal sym- 
metry in spacetimes with ny—1 space and n_—1 time dimensions. This means we 
should be interested in O(n) for n<6, and their “Wick rotations”: transformations 
that put in extra factors of 7 to change some signs on the metric. Coincidentally, these 
are just the cases where the Lie algebras of the orthogonal groups are equivalent to 
those of some algebras for smaller matrices. The smaller representation then can be 
identified as the “spinor” representation of that orthogonal group. Since the “vec- 
tor”, or defining representation space of the orthogonal group, itself is represented as 
a matrix with respect to the other group (i.e., the state carries two spinor indices), 
the other group may include certain phase transformations (such as —1) that cancel 
in the transformation of the vector. The other group is then called the “covering” 
group for that orthogonal group, since it includes those missing transformations in 
its defining representation. (As a result, its group space also has a more interesting 


topology, which we won’t discuss here.) 


One way to discover these covering groups is to first count generators, then try 
to construct explicitly the orthogonal metric on matrices. SO(n) has n(n—1)/2 gen- 
erators (antisymmetric matrices), Sp(n) has n(n+1)/2 (symmetric), and SU(n) has 
n’—1 (traceless). (These are hermitian generators, since we applied reality or her- 
miticity.) So, for some group SO(n), we look for another group that has the same 
number of generators. Then, if the new group is defined on mxm matrices, we look 
for conditions to impose on an mxm matrix (not necessarily the adjoint) to get an 
n-component representation. This is easy to do by inspection for small n; for large n 
it’s easy to see that it can’t work, since m will be of the order of n, and the simple 


2 components instead of n. We then construct 


constraints will give of the order of n 
the norm of this matrix M as tr(M'M), which is just the sum of the absolute value 
squared of the components, for SO(n), and the other orthogonal groups by Wick 


rotation. (Wick rotation affects mainly the reality conditions on /.) 


108 I. GLOBAL 


The identifications for the Lie algebras are then: 


§0(2) = U(1), SO(1,1) = GL(1) 

$0(3) = SU(2) = SU*(2) = USp(2), $O(2,1) = SU(1,1) = SL(2) = Sp(2) 

$0(4) = SU(2)@SU(2), $0(3,1) = SL(2,C) = Sp(2,C), $0(2,2) = SL(2)@SL(2) 
( 


SO(5) = USp(4), SO(4,1) = USp(2,2), SO(8,2) = Sp(4) 

SO(6) = SU(4), SO(5,1) = SU*(4), $O(4,2) = SU(2,2), SO(38,3) = SL(4) 

Note that the Euclidean cases are all unitary, while the ones with (almost) equal 
numbers of space and time dimensions are all real. There are also some similar 


relations for the pseudoreal orthogonal groups: 
SO*(2) = U(1), + SO*(4) = SU(2)@SL(2), SO*(6) = SU(8,1), SO*(8) = SO(6,2) 
The norm and conditions for an m-spinor of SO(n;,n_) are: 
de 
rf _vorm —[eommetny == | vents = 
1 


PP Pe=a. 
=€E2E z 
=€E2E ZF z& 


2 a 
3 i] z Cee SG z 
4 aia Z7 Evya€ 5! 3! 
Bl] 2° 2Pesvag | —z (2°? Qga = 0) |] sez | Ze(V27) Z 
ZF 2 5 Be —Z FEZ —222 | se(V2V) | z 


Note that in all but the 2D cases the norms are associated with determinants: For 


D=3 and 4 the norm is given by the determinant, while for D=5 and 6 we use the 


fact that the determinant of an antisymmetric matrix is the square of the Pfaffian. 


Exercise IC5.1 
Show that for D=5 zze and zz give the same norm. (Hint: Consider 
2oa82-9}-) 

Unfortunately, for SO(n) for larger n, the spinor is as least as large as, and usu- 
ally larger than, the vector. In general, the spinor is like the “square root” of the 
vector, in that the vector can be found by taking the direct product of two spinors. 
It is impossible to find the spinor representation by taking direct products of vec- 
tors. This situation occurs only for orthogonal groups: In all other classical groups, 
all (finite-dimensional) representations are among those obtained from multiple di- 
rect products of vectors. Furthermore, in those cases the “irreducible” representa- 
tions (those that can’t be divided into smaller representations) can be picked out by 
(anti)symmetrization, and by separating trace and traceless pieces (where traces are 


taken with the group metrics). Fortunately, for the above cases of orthogonal groups, 
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we can perform the same construction starting with the spinor representations, since 


those are the “vectors” of non-orthogonal groups. 
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II. SPIN 


Special relativity is simply the statement that the laws of nature are symmetric 
under the Poincaré group. Free relativistic quantum mechanics or field theory is then 
equivalent to a study of the representations of the Poincaré group. Since the con- 
formal group is a classical group, while its subgroup the Poincaré group is not, it is 
easier to first study the conformal group, which is sufficient for finding the massless 
representations of the Poincaré group. The massive ones then can be found by di- 
mensional reduction, which gives them in the same form as occurs in interacting field 
theories. In four spacetime dimensions we use the covering group of the conformal 
group, which is the easiest way to include spinors. These methods extend straight- 
forwardly to supersymmetry, a symmetry between fermions and bosons that includes 


the Poincaré group. 


ieee seed A. TWO COMPONENTS ......0.0......... 


Although we have already specialized to spacetime symmetries, we have consid- 
ered arbitrary spacetime dimensions. We have also noted that many of the lower- 
dimensional Lie groups have special properties, especially with regard to covering 
groups. In this section we will take advantage of those features; specifically, we ex- 
amine the physical case D=4, where the rotation group is SO(3)=SU(2), the Lorentz 
group is SO(38,1)=SL(2,C), and the conformal group is SO(4,2)=SU(2,2). 


1. 3-vectors 


The most important nontrivial Lie group in physics is the rotations in three 
dimensions. It is also the simplest nontrivial example of a Lie group. This makes 
it the ideal example to illustrate the properties discussed in the previous chapter, 
as well as lay the groundwork for later discussions. We have already mentioned the 
orbital part of rotations, i.e., the representation of rotations on spatial coordinates. 
In this chapter we discuss the spin part; this is really the same as finding all (finite 


dimensional, unitary) representations. 


Since the earliest days of quantum mechanics, we know that half-integer spins 
exist, in nature as well as group theory, e.g., the electron and proton. This might be 
expected to complicate matters, but actually simplifies them, due to the well-known 
inequality 


ee 
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This means that a “spinor”, describing spin 1/2, has only 2 components, compared 
to the 3 components of a vector, and its matrices (e.g., for rotations) are thus only 
2x2 instead of 3x3. 


We first consider spinors in matrix notation, then generalize to “spinor notation” 
(spinor indices). The simplest way to understand why rotations can be represented 
as 2x2 instead of 3x3 is to see why 3-vectors themselves can be understood as 
2x2 matrices, which for some purposes is simpler. (This is equivalent to Hamilton’s 
“quaternions”, which predated Gibbs’ vector notation, and were used by Maxwell for 
his equations. This way also generalizes in a very simple way to relativity, in three 
space and one time dimensions.) Consider such matrices to be hermitian, which 
is natural from the quantum mechanical point of view. Then they have four real 
components, one too many for a three-vector (but just right for a relativistic four- 


vector), so we restrict them to also be traceless: 
V=V'. rV=0 


The simplest way to get a single number out of a matrix, besides taking the trace, is 
to take the determinant. By expanding a general matrix identity to quadratic order 
we find an identity for 2x2 matrices 


det(I+M)=e™ "4M 5 _2 det M = tr(M”’) — (tr M)? 


It is then clear that in our case —det V is positive definite, as well as quadratic, so 


we can define the norm of this 3-vector as 
|V|? = —2 det V = tr(V’) 
This can be compared easily with conventional notation by picking a basis: 


Vi tai) | Bees 
Vad vaya “yi )=¥-a => detV=-3(V’) 


where ¢ are the Pauli o matrices, up to normalization. As usual, the inner product 


follows from the norm: 
V+WwP=|VP + |W? +2V-W 
=> V-W=det V + det W —det(V+W) =tr(VW) 
Applying our previous identities for determinants to 2x2 matrices, we have 


MCM'C =I det M,  M~t=CM*C(det M)~' 
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where we now use the imaginary, hermitian matrix 


c=(7) 


i 0 
If we make the replacement M — e™ and expand to linear order in M, we find 
M+CM'C=ItrM 
This implies 
trV=0 #6 (VC) =VC 


i.e., the tracelessness of V is equivalent to the symmetry of VC. Furthermore, the 


combination of the trace and determinant identities tell us 
M’?=MtrM—-IdetM => V?=-—I detV =I3|V/ 


Here by “V2” we mean the square of the matrix, while “|V|?”= (V*)? is the square 


of the norm (neither of which should be confused with the component V? = V‘6?.) 


Again expressing the inner product in terms of the norm, we then find 
{VW} =V-W)t 


Also, since the commutator of two finite matrices is traceless, and picks up a minus 
sign under hermitian conjugation, we can define an outer product (vector xvector = 
vector) by 

[V, W] = V2iV x W 


Combining these two results, 


VW =5(V -W)l+ iV x W 


In other words, the product of two traceless hermitian 22 matrices gives a real trace 
piece, symmetric in the two matrices, plus an antihermitian traceless piece, antisym- 
metric in the two. Thus, we have a simple relation between the matrix product, the 
inner (“dot”) product and the outer (“cross”) product. Therefore, the cross product 


is a special case of the Lie bracket, or commutator. 


Exercise ITA1.1 


Check this result in two ways: 


a Show the normalization agrees with the usual outer product. Using only the 
above definition of V x W, along with {V,W} = (V-W)J, show 


AV sway WP =-ivp PW) 
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b Use components, with the above basis. 
Exercise ITA1.2 
Write an arbitrary two-dimensional vector in terms of a cor 
a Show that the phase (U(1)) transformation V’ = Ve generates the usual 


plex number as 


rotation. Show that for any two vectors V; and V2, V;*Vo is invariant, and 


identify its real and imaginary parts in terms of well known vector prod- 
ucts. What kind of transformation is V — V*, and how does it affect these 
products? 

b Consider two-dimensional functions in terms of z = ala + iy) and z* = 
<3(x — iy). Show by the chain rule that 0. = 4;(0, — iy). Write the real 
and imaginary parts of the equation 0,*V = 0 in terms of the divergence and 
curl. (Then V is a function of just z.) 


c Consider the complex integral 


where “$” is a “contour integral”: 
complex plane defined by parametrizing dz = du(dz/du) in terms of some 


real aed U. ae is useful if V can be saps eae as ahs - = 


integral and “surface” integral. (In two dimensions a surface element differs 
from a line element only by its direction.) Use this fact to solve Gauss’ law 
in two dimensions for a unit point charge as HE = 1/47z. 
Exercise ITA1.3 

Consider electromagnetism in 2x2 matrix notation: Define the field strength 
as a complex vector F = V/2(E + iB). Write partial derivatives as the sum 
of a (rotational) scalar plus a (3-)vector as 0 = 310, + V, where 0, = 0/0t 
is the time derivative and V is the partial space derivatives written as a 


traceless matrix. Do the same for the charge density p and (3-)current j as 
= = wales p 4 - —— = serene . om ame cross — in terms oe 


eusiliais oF = — hes ernie into its be sud pekene pieces, ana 
its hermitian and antihermitian pieces, gives the usual Maxwell equations 


V-B=0, V-E=p, Vx#£E+08B=0, Vx B-OF =] 
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(Note: Avoid the Pauli o-matrices and explicit components.) 


2. Rotations 


One convenience of representing three-vectors as 2x2 instead of 3x1 is that rota- 
tions are easier to write. Since vectors are hermitian, we expect their transformations 


to be unitary: 
Vi=uvUl, Ut=uU™ 


It is easily checked that this preserves the properties of these matrices: 
VY SUVEy=V"0 eV) =v ert UV) HV) =0 
Furthermore, it also preserves the norm (and thus the inner product): 
det(V’) = det(UVU~') = det(U)det(V) (det U)~' = det V 


Unitary 2x2 matrices have 4 parameters; however, we can elimimate one by the 
condition 
det U=1 

This eliminates only the phase factor in U, which cancels out in the transformation 
law anyway. Taking the product of two rotations now involves multiplying only 2x2 
matrices, and not 3x3 matrices. 

We can also write U in exponential notation, which is useful for going to the 
infinitesimal limit: 


U= 3s G=G, trG=0 


This means that G itself can be considered a vector. Rotations can be parametrized 
by a vector whose direction is the axis of rotation, and whose magnitude is (1//2x) 


the angle of rotation: 
Veeve => iVed6,V) —=2eC%V 
We also now see that the Lie bracket we previously identified as the cross product is 


the bracket for the rotation group. 
Exercise ITA2.1 


Evaluate the elements of the matrix e’@ in closed form for a diagonal generator 


G. Generalize this result to arbitrary G. (Hint: Use rotational invariance.) 


The hermiticity condition on V can also be expressed as a reality condition: 


V=V!' and tV=0 => V*=-CVC, (VC)*¥=C(VVOC 
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“KO” 


where is the usual complex conjugate. A similar condition for U is 


U't=U! and detU=1 3 U*=CUC 


which is also a consequence of the fact that we can write U in terms of a vector as 
U =e’. Asa result, the transformation law for the vector can be written in terms 


of VC in a simple way, which manifestly preserves its symmetry: 
(VCY=UVU“CaUVvout 


Exercise ITA2.2 


We return to our example of D=2: 


a Write an arbitrary rotation in two dimensions in terms of the slope (dy/dx) 
of the rotation (the slope to which the x-axis is rotated) rather than the 
angle. (This is actually more convenient to measure if you happen to have 
a ruler, which you need to measure lengths anyway, but not a protractor.) 
This avoids trigonometry, but introduces ugly square roots: Compare Lorentz 
transformations. Also note that this square root form covers only half of the 
available angles. 

b Show that the square roots can be eliminated by using the slope of half the 
angle of transformation as the variable. Show the relation to the variables 
used in writing 3D rotations in terms of 2x2 matrices, i.e., the use of complex 


variables, as in exercise I[A1.2a. (Hint: Consider U and VC diagonal.) 


3. Spinors 


Note that the mapping of SU(2) to SO(3) is two-to-one: This follows from the 
fact V’ = V when U is a phase factor. We eliminated continuous phase factors from 
U by the condition det U = 1, which restricts U(2) to SU(2). However, 


det(Ie”) =e" -1 > e¥=41 


for 2x2 matrices. More generally, for any SU(2) element U, —U is also an element of 
SU(2), but acts the same way on a vector; i.e., these two SU(2) transformations give 
the same SO(3) transformation. Thus SU(2) is called a “double covering” of SO(3). 
However, this second transformation is not redundant, because it acts differently on 


half-integral spins, which we discuss in the following subsections. 
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The other convenience of using 2x2 matrices is that it makes obvious how to 
introduce spinors — Since a vector already transforms with two factors of U, we 


define a “square root” of a vector that transforms with just one U: 
w=Up => pl =ylut 


where w is a two-component “vector”, i.e., a 2X1 matrix. The complex conjugate of 


a spinor then transforms in essentially the same way: 
(Cy) = CU*Y* = U(CYy*) 


Note that the antisymmetry of C’ implies that ~ must be complex: We might think 
that, since Cw* transforms in the same way as w, we can identify the two consistently 


with the transformation law. But then we would have 
y= Cy* = C(CYy*)* = CC*y = - 


Thus the representation is pseudoreal. The fact that Cw* transforms the same way 


under rotations as w leads us to consider the transformation 
Ul = Cyr 


Since a vector transforms the same way under rotations as ~y', under this transfor- 
mation we have 


V' =CV*C =-V 
which identifies it as a reflection. 


Another useful way to write rotations on w (like looking at VC instead of V) is 
(pTCY = (p*C)U™ 
This tells us how to take an invariant inner product of spinors: 


y=Uy, x =Ux => (pb Cx) = (be Cy) 


In other words, C’ is the “metric” in the space of spinors. An important difference of 
this inner product from the familiar one for three-vectors is that it is antisymmetric. 


Thus, if ~ and y are anticommuting spinors, 
wrx = —x7 CT py = xT Cp 


where one minus sign comes from anticommutativity and another is from the anti- 


symmetry of C’. Thus, it makes sense to take the norm of an anticommuting spinor 
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as W? Cw, which would vanish if 7 were commuting. Of course, since rotations are 


unitary, we also have the usual yy as an invariant, positive definite, inner product. 


Exercise ITA3.1 
Consider a hermitian but not traceless 2x2 matrix M (M = M1, tr M #0). 


a Show 
dt M=0 => M=4ve@u' 


for some commuting spinor (column vector) ~ (and some sign +). 


b Define a vector by 
V = V2(M — 4I tr M) 


Show |V| (not |V|?) is simply tw. 


4. Indices 


The best way to discuss general spins is to use index notation, rather than matrix 


notation. Then a spinor rotates as 


We = Ua We 


with two-valued indices 


The inner product is defined by 


V°Xa = b*Cpax” — —Wax” 


where we have defined raising and lowering of indices by 


Ya =W'Coa, YP = CM rbg 


paying careful attention to signs. (In general, we fix signs by using a convention of 
contracting indices from upper-left to lower-right.) Then objects with many indices 


transform as the product of spinors: 


= U.'Ug*...Uy' Aée..¢ 


! 
a... 


An infinitesimal transformation is then a sum: 


—i0Agg...y = Ga’ Asp. oF Gia? Aas... = ei G,? Aap...6 
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This is also true for Cag, even though it is an invariant constant: 


C! 


B= UU? Cy = Cap det U = Cop 
A more interesting case is the vector: The transformation law is 


V! 


g =U Us'V5 


where Vg is the symmetric VC considered earlier (in contrast to the antisymmetric 
Ch. 


There is basically only one identity in index notation, namely 
0= $C loaCry5 = CapCys + Cp4Cas + CyaC gs 


The expression vanishes because it is antisymmetric in those indices, and thus the 
indices must all have different values, but there are three two-valued indices. Another 


way to write this identity is to use the definition of C° as the inverse of Cyg: 
By — §6 5 sy = 5 156 
Cae =o. = OygC = Oa] = 010g = 0204 


This tells us that antisymmetrizing in any pair of indices automatically contracts 


(sums over) them: Contracting this identity with an arbitrary tensor A,5, 
Alas] = Cap” Ays = —CapA”, 


That means that we need to consider only objects that are totally symmetric in their 
free indices. This gives all spins: Such a field with 2s indices describes spin s; we have 


already seen spins 0, 1/2, and 1. 


We have defined the transformation law of all fields with lower indices by con- 
sidering the direct product of spinors. Transformations for upper indices follow from 


multiplication with C°%: They all follow from 
wie = PU") 69 


Since the vertical position of the index indicates the form of the transformation law, 


we define 

Wa = Qb*)* 
where the “ ” indicates complex conjugation. Thus, a hermitian matrix is written 
as 


M. = (M") 4? = (M,")* = Ml = Mag = M60 
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So, for a vector we have 


Vos = Vogp = Vea 


Spin s is usually formulated in terms of a (2s+1)-component “vector”. Then 
one needs to calculate Clebsch-Gordan-Wigner coefficients to construct Hamiltonians 
relating different spins. For example, to couple two spin-1/2 objects to a spin-1 object, 
one might write something like V - wléy. The matrix elements of the Pauli matrices 
ao are the CGW coefficients for the spin-1 piece of $ ® $ = 100. This method gets 
progressively messier for higher spins. On the other hand, in spinor notation such a 
term would be simply V°%7).g; no special coefficients are necessary, only contraction 
of indices. Similarly the decomposition of products of spins involves only the picking 


out of the various symmetric and antisymmetric pieces: For example, for 5 ® s, 


WaXB as 5(WaXxXe) a WaX.B}) = 5U(aX8) _ Copy’ xy = Vas = Caps 


where (a) means to symmetrize in those indices, by adding all permutations with 
plus signs. We have thus explicitly separated out the spin-1 and spin-0 parts V and 
S of the product. The square roots of various integers that appear in the CGW 
coefficients come from permutation factors that appear in the normalizations of the 


various fields/wave functions that appear in the products: For example, 
A” Aubry _ |\APee/? ah. 3| AP®e | cs 3| A®ee |? ats | AP? |? 


In the spinor index method, the square roots never appear explicitly, only their squares 
appear in normalizations: For example, in calculating a probability for A ® B —- C, 
we evaluate 
(A ® BIC)(C|A ® B) 
(A|A)(B|B)(C|C) 

where A, B, and C' each have 2s indices for spin s, and (|) means contracting all 
indices (with the usual complex conjugation). (Normalizing states to other than 1 is 
often convenient and sometimes necessary: For example, plane waves are normalized 


with 6 functions.) 


Exercise IJA4.1 
Redo exercise ITA3.1 in index notation: For xg above (both now bosonic), 
show V°V,g = —2S?. 
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5. Lorentz 


Consider now a 2x2 matrix, whose elements we label as 


. yot yee vt yex 
Vires : a 
yoo yee vi V- 


=A(jo ly V? +iv8 


veya ivs ye-yl 


which we choose to be hermitian, 


Javea 


Vavt = ve =(viye = (ye 


where we distinguish the right spinor index by a dot because it will be chosen to 
transform differently from the left one: According to our discussion of subsection 
IB5, this is the general labeling consistent with hermiticity, ie., V’ = gVgt (but 
without the extra restriction of group unitarity of the previous subsections). For 
comparison, lowering both spinor indices with the matrix C’ as for SU(2), and the 
vector indices with the Minkowski metric (in either the orthonormal or null basis, as 
appropriate — see subsection IA4), we find another hermitian matrix 


Ma-(% caer aes Y= 8) von 


AY Vi) BAwB+iIM WV ap 


In the orthonormal basis, ao, are the Pauli matrices and the identity, up to normal- 
ization. They are also the Clebsch-Gordan-Wigner coefficients for spinor®spinor = 
vector. In the null basis, they are completely trivial: 1 for one element, 0 for the rest, 
the usual basis for matrices. In other words, they are simply an arbitrary way (ac- 
cording to choice of basis) to translate a 2x2 (hermitian) matrix into a 4-component 


vector. We will sometimes treat a vector index “a” as an abbreviation for a spinor 


index pair “ad”: 
Ve=V", a=ad = (66,06, 06,88) = (+,4,t,-) 
where a and @ are understood to be independent indices (6 4 4, etc.). 


Examining the determinant of (either version of) V, we find the correct Minkowski 


norms: 
ae V BV V OV ear ey ye raya 


Thus Lorentz transformations will be those that preserve the hermiticity of this matrix 


and leave its determinant invariant: 


V'=Q9Vg', detg=1 
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(det g could also have a phase, but that would cancel in the transformation.) Thus g 


is an element of SL(2,C). In terms of the representation of the Lie algebra, 
g=e°, trG=0 
Thus the group space is 6-dimensional (G has three independent complex compo- 
nents), the same as SO(3,1) (where gng? = 7 > (Gn)? = —Gn). 
Exercise IIA5.1 


SL(2,C) also can be seen (less conveniently) from vector notation: 


a Consider the generators 


- = $(Jap ote id €abead“) 


a 


of SO(3,1). Find their commutation relations, and in particular show 
[J, J] = 0. Express I? in terms of a. Show Da have the same 


commutation relations as J;;. Finally, take a general infinitesimal Lorentz 
E) 


transformation in terms of J,, and rewrite it in terms of Ba 
attention to the reality properties of the coefficients. This demonstrates that 
the algebra of SO(3,1) is the same as that of SU(2)@SU(2), but Wick rotated 


to SL(2,C). 


, paying special 


b Apply the same procedure to SO(4) and SO(2,2) to derive their covering 


groups. 


Exercise ITA5.2 


Consider relativity in two dimensions (one space, one time): 


a Show that SO(1,1) is represented in lightcone coordinates by 
gf? as Ag, ae a a 


for some (nonvanishing) real number A, and therefore SO(1,1) = GL(1). Write 
this one Lorentz transformation, in analogy to exercise I[A1.2a on rotations 
in two space dimensions, in terms of an analog of the angle (“rapidity”) for 
those transformations that can be obtained continuously from the identity. 


Do the relativistic analog of exercise ITA2.2. 


b Still using lightcone coordinates, find the parity and time reversal transfor- 
mations. Note that writing A as an exponential, so it can be obtained con- 
tinuously from the identity, restricts it to be positive, yielding a subgroup of 
GL(1). Explicitly, what are the transformations of O(1,1) missing from this 
subgroup? Which of P, (C)T, and (C)PT are missing from these transforma- 
tions, and which are missing from GL(1) itself? 
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In index notation, we write for this vector 


toy ge br, 
Vig 90 9B Vy 


while for a (“Weyl”) spinor we have 
vy = Ga bp 


The metric of the group SL(2,C) is the two-index antisymmetric symbol, which is 


also the metric for Sp(2,C): In our conventions, 


We also have the identities 
dt Lf = lO Calg ge St lyase), Ve i SO" C alg (det) 


Alas] = CapC” Ays, Ajagy = 9 


discussed earlier in this section. As there, we use the metric to raise, lower, and 


contract indices: 


Wa =WCpar Va= WC, 


V-W=Vv"w, 5 


These results for SO(3,1) = SL(2,C) generalize to SO(4) = SU(2)@SU(2) (relevant 
to the Standard Model: see subsection IVB2) and SO(2,2) = SL(2)@SL(2). As 


described earlier, the reality conditions change, so now 
SO): (V°")*=VPC,.Crp,  SO(2,2):  (V2')* =v 


consistent with the (pseudo)reality properties of spinors for SU(2) and SL(2), where 
we now use unprimed and primed indices for the two independent group factors 
(V — 9V9q'). 
Exercise ITA5.3 
Take the explicit 2x2 representation for a vector given above, change the 
factors of 7 to satisfy the new reality conditions for SO(4) and SO(2,2), and 


show the determinant gives the right signatures for the metrics. 


A common example of index manipulation is to use antisymmetry whenever pos- 
sible to give vector products. For example, from the fact that V°? VYCy 4 18 antisym- 
metric in ay we have that 

aB . — lsay2 
vel, = 33V 
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where the normalization follows from tracing both sides. Similarly, 
aB P aB .— fey, 
V Wig tw Vig = OyV W 
It then follows that 
aB 5 _ (sa a8 5 ad ad 
V Wi3V" = (0SV -W-W Vig) =V-WV® - sV°W 


Antisymmetry in vector indices also implies some antisymmetry in spinor indices. 
For example, the antisymmetric Maxwell field strength F,, = —Fj,, after translating 
vector indices into spinor, can be separated into its parts symmetric and antisymmet- 
ric in undotted indices; antisymmetry in vector indices (now spinor index pairs) then 
implies the opposite symmetry in dotted indices: 


_ ‘dl _fn ig — 5 
F5,85 = —F 68,04 = 7 F (aay 48) + Faas) = Cssfoa + Cashys, fas = 3F a3," 


Thus, an antisymmetric tensor also can be written in terms of a (complex) 2x2 
matrix. (However, our normalization of tensor vs. symmetric spinor matrix will vary 


according to application.) 


We also need to define complex (hermitian) conjugates carefully because C' is 


imaginary, and uses indices consistent with transformation properties: 


vO = (Vi) = (yreyt = gor 00 


where we assume the spinor is fermionic (when re-ordering for hermitian conjugation), 


and we have used the spacetime coordinates as an example of a real vector (hermitian 


O99 66 999 


2x2 matrix). (Sometimes we will drop the on ~*, since it is redundant to the 
Note that, unlike SU(2), Ja 4 (W%)*.) In general, hermitian conjugation properties 
for any Lorentz representation are defined by the corresponding product of spinors: 


For example, 
(4p(%y))t = (4p) = pe) > fees eG iain 
More generally, we find 
(Tor) B1--Be) = (—1)90-D/2+he-1)/27(A1-Ba)(G1--.43) 
As we'll see later, most spinor algebra involves, besides spinors, just vectors and 


antisymmetric tensors, which carry only two spinor indices, so matrix algebra is often 


useful. When using bra-ket notation for 2-component spinors, it is often convenient 
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to distinguish undotted and dotted spinors. Furthermore, since spinor indices can 
be raised and lowered, we can always choose the bras to carry upper indices and the 
kets lower, consistent with our index-contraction conventions, to avoid extra signs 
and factors of C’. We therefore define (see subsection IB1) 


=| [vl = lal, 


Vall; f=()f%ol => F* = [1fa" lal 


where we now use “angle brackets” to denote the undotted spinor basis and “square 


brackets” to denote the dotted. As a result, we also have 


(bx) = (xb) =¥°xXa, [bx] =0 xa: (bx)! = [bx] 


(bIV [x] =¥°VaPxg, WIE) = ofa? x 
VW*+WV* =(V-W)I 


where we have used the anticommutativity of the spinor fields. From now on, we use 


this notation for the matrix representing a vector V (V,’), rather than the one with 
which we started (V°"). 


Exercise ITA5.4 
Consider the generators 


Gof = tA. T i) ‘a| 


and their Hermitian conjugates, where 003 =F dx°®, Show their algebra 
closes. What group do they generate? Find a subset of these generators that 


can be identified with (a representation of) the Lorentz group. 


Since we have exhausted all possible linear transformations on spinors (except for 
scale, which relates to conformal transformations), the only way to represent discrete 


Lorentz transformations is as antilinear ones: 
b= V2nPbs, (= —v 2nd *) 


From its index structure we see that n is a vector, representing the direction of the 


reflection. The product of two identical reflections is then, in matrix notation 
yy" _ 2n(np*)* = new =, n2 | 


where we have required closure on an SL(2,C) transformation (+1). Thus n is a unit 
vector, either spacelike or timelike. Applying the same transformation to a vector, 


where V transforms like w%y, we write in matrix notation 


V’ = -2nV*n = n?V —2(n-V)n 
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(The overall sign is ambiguous, and depends on whether it is a polar or axial vector.) 
This transformation thus describes parity (actually CP, because of the complex con- 
jugation). In particular, to describe purely CP without any additional rotation (i.e., 
exactly reflection of the 3 spatial axes), in our basis we must choose a unit vector in 


the time direction, 
VIn% = 6% => t= Fy, (HF =a) 
=> Vir = Vea 


which corresponds to the usual in vector notation, since in our basis 
aB _ na 
= O Ba 


To describe time reversal, we need a transformation that does not preserve the com- 


plex conjugation properties of spinors: For example, CPT is 
yi = wr, yp = ph = Vv! = _V 


(The overall sign on V is unambiguous.) 


In principle, whenever we work on a problem with both spinors and vectors we 
could use a mixed vector-spinor notation, converting between the usual basis for 
vectors and the spinor-index basis with identities such as 

O84 = Ou, 08,07) = 8458 
However, in practice it’s much simpler to use spinor indices exclusively, since then 
one needs no o-matrix identities at all, but only the trivial identities for the matrix 
C that follow from its antisymmetry. For example, converting the vector index on 
the o matrices themselves into spinor indices (a > a3), they become trivial: 
ap i= £2 B 
(or) a8, Oe 
(This is the same as saying an orthonormal basis of vectors has the components 


(V*), = 6f when the components are defined with respect to the same basis. ) 


Thus, the most general irreducible (finite-dimensional) representation of SL(2,C) 
(and thus SO(3,1)) has an arbitrary number of dotted and undotted indices, and is 


totally symmetric in each: Aa, Treating a vector index directly as a 


stim (Bin Bny” 
dotted-undotted pair of indices (e.g., a = ad, which is just a funny way of labeling 


a 4-valued index), we can translate into spinor notation the two constant tensors of 
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SO(3,1): Since the only constant tensor of SL(2,C) is the antisymmetric symbol, they 


can be expressed in terms of it: 


C26. 


ae abr ad, a8,94,55 = UCapCy6C x5C a, — CasCayC'y5Cs5) 


When we work with just vectors, these can be expressed in matrix language: 


V-W =tr(VW*) 


€abeaV °W?X°Y 4 = e(V,W, X,Y) =i tr(VW*XY* —Y*XW*V) 


(We have assumed real vectors; for complex vectors we should really write V-W* = ..., 
etc.) 


Exercise ITA5.5 
Prove this expression for the € tensor (in either index or matrix version) 
agrees with that defined in subsection IB3 (as modified in subsection IB5) 
by (1) showing total antisymmetry, (2) explicitly evaluating a nonvanishing 


component. 


6. Dirac 


The Dirac spinor we encountered in subsection IC1 is a 4-component reducible 


representation in D=4: in terms of two (“left” and “right”) two-component spinors, 


ee 
Wra 


The Hermitian metric Y that defines the (Lorentz-invariant) Dirac spinor inner prod- 
uct 
OY =WIY =r the, W=wir = (Ws o%) 


r= ( 0 oe) =v 


takes the simple form 


oe if 


The Dirac matrices are given by 


a pf Oy 0 Vv 
—Te=\ve, o J \-v* 0 


where the indices have been chosen to insure that the y matrices always take a Dirac 


spinor to the same type of spinor. Since {VY, W} = —V - W, the y matrices satisfy 


{7} ==9" 
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The extra sign is the result of normalizing the y’s to be pseudohermitian with respect 
to the metric: Yy'Y-! = +4. This Dirac spinor can be made irreducible by imposing 


a reality condition that relates w, and Wr: The resulting “Majorana spinor” is then 


We 
v=4(5) 


The product of all the y’s is a pseudoscalar, and an additional y-matrix: 
58 
av? ‘ta. if ~e Y. 
Y-1 = GW EabedY VY YY =4( 0 i) 
=> tat = 0, Lagat =-l 
(This is usually called “ys” in the literature for D = 4, or “yp” for D 4 4. We have 


renamed it for consistency with dimensional reduction.) It can be used to project a 


Dirac or Majorana spinor onto its two two-component spinors: 


Ha = 3(F + V2i9-1) = (59) (09) 


Various identities for these matrices can be derived directly from the anticommu- 


tation relations: For example, 


Wa = —2, VO = tt, dba =a-b, VAY a = Va 
ith, tr(a¥) = —2a-b, tr(db¢d) =a-bc-d+a-db-c—a-cb-d 


The trace identities follow from the fact that the only way to get a nonvanishing trace 
out of a product of y matrices is when there are terms proportional to the identity; 
since {y*, 7°} = —n”, this only happens when the indices are pairwise identical. The 
above results then follow from examination of relevant special cases. (Traces of odd 
numbers of y matrices vanish. An exception is y_;, until it is rewritten in terms of 


its definition as the product of the other y-matrices.) 


Although use of the anticommutation relations is convenient for generalization of 
such identities to arbitrary dimensions, 2-spinor bra-ket notation is easier for deriving 
4D identities. Since a Dirac spinor is the direct sum of a Weyl spinor and its complex 


conjugate, we write 
Y=(drot (Wr Y= Veal + VElal 


In this notation, there is no need to use a spinor metric Y, just as in Minkowski 
4-vector bra-ket notation there is no need for an explicit matrix to represent the 


Minkowski metric: It is included implicitly in the definition of the inner product 
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for the basis elements ((a|b) = Mav OF (als) = Cag). Thus hermitian conjugation is 
automatically pseudohermitian conjugation, etc.: W is W', from the effect of hermitian 
conjugating the basis vectors along with the components of the spinors they multiply. 


(See subsections IB4-5.) We then have simply 


Yog =—ladlgl—lphels H+ = (al, = T- = |"Jlal 


where we have replaced the vector index a > a on Yq. 


Exercise ITA6.1 


Use this representation for the y matrices and projection operators J/+ for all 


of the following: 
a Derive 


dey oe ‘dont Yep = flonti +++ gta 
dy ‘don Vag = —51 tr(gy +++ flan) — Y-1 tr(y-1i + don) 


b Rederive the trace identities above. (Hint: For the last identity, use the 
identity CjagC'y)5 = 0 repeatedly.) 
c Show that 
tr|(IT, — H_) Yaa 8 V4V55] = 2 ae, oB-74,88 
by comparison with the expression of the previous subsection for e. 
Exercise ITA6.2 
Again using this representation: 
a Show that 727) (up to a proportionality constant) generates the usual Lorentz 
transformations of SL(2,C) on the 2 2-component spinors in the Dirac spinor. 


b Relate this representation of the ~ matrices to the defining representation of 
Sp(4) as given in subsection IB5, noting that Sp(4) is the covering group of 
SO(3,2) (subsection IC5). 


7. Chirality /duality 


IT, are often called “chiral projectors”; 2-component spinors (not paired into 
Dirac spinors) are often called “chiral spinors”, and appear in “chiral theories”; the 
two 2-component spinors of a Dirac spinor are often labeled as having left and right 
“chirality”; etc. When these two halves decouple, a theory can have a “chiral sym- 


metry” 


Wy — ewe 
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Since chirality is closely related to parity (chiral spinors can represent CP, but need to 
be doubled to allow C, and thus P), Dirac spinors are often used to describe theories 
where parity is preserved, or softly broken, or to analyze parity violation specifically, 
using y_; to identify it. 

A similar feature appears in electrodynamics. We first translate the theory into 
spinor notation: The Maxwell field strength F{» is expressed in terms of the vector po- 
tential (“gauge field”) A,, with a “gauge invariance” in terms of a “gauge parameter” 
A with spacetime dependence. The gauge transformation 6A, = —0O, becomes 


t = 7. — e 
As = Ag 0 3X 


where 03 =O] dx. It leaves invariant the field strength Foy = OjaAy): 


fo gO 4A 


i=, if 
ary, 85 ay" 35 Ogg Acy = 40 3 + FF 


(as)38) + Faycsi)) 
= Cysfap t+ Cashes, fas = 7044p)" 
Maxwell’s equations are 
Os faa~ Tas 
They include both the field equations (the hermitian part) and the “Bianchi identi- 
ties” (the antihermitian part). 

Exercise IIA7.1 

We already saw VW* + WV* gave the dot product; show how VW* — WV* 
is related to the “cross product” VigWy). 

Exercise ITA7.2 

Write Maxwell’s equations, and the expression for the field strength in terms 
of the gauge vector, in 2x2 matrix notation, without using C’s. Combine 
them to derive the wave equation for A. 

Maxwell’s equations now can be easily generalized to include magnetic charge by 
allowing the current J to be complex. (However, the expression for F’ in terms of A is 
no longer valid.) This is because the “duality transformation” that switches electric 
and magnetic fields is much simpler in spinor notation: Using the expression given 


above for the 4D Levi-Civita tensor using spinor indices, 


d : 
rae = $€abea => fos = =t1 aA 


a 


More generally, Maxwell’s equations in free space (but not the expression for F’ in 


terms of A) are invariant under the continuous duality transformation 


iO 
fap = e Jag 
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(and J’ = en. 3 in the presence of both electric and magnetic charges). 


Exercise IIA7.3 
Prove the relation between duality in vector and spinor notation. Show that 


Fyy + is€aveaF"™ contains only fag and not fag: 


Exercise ITA7.4 


How does complexifying J, 3 modify Maxwell’s equations in vector notation? 


In even time dimensions, Wick rotation kills the 7 (or —7) in the spinor-index 
expression for €qg’,y’,<c/n@’- Since the (discrete and continuous) duality transformation 
now contains no 2, we can impose self-duality or anti-self-duality; i.e., that fag or fara 


vanishes, since they are now independent and real instead of complex conjugates. 


These continuous chirality and duality symmetries on the field strengths generalize 
to the free field equations for arbitrary massless fields in four dimensions. For reasons 
to be explained in the following section, they distinguish the two polarizations of the 
waves described by such fields. They are closely related to conformal invariance: In 
higher dimensions, where not all free, massless theories are conformal (even on the 


mass shell), these symmetries exist exactly for those that are conformal. 
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The general procedure for finding arbitrary representations of the Poincaré group 


relevant to physics is to: 


(1) Describe spin 0. As we have seen, this means starting with the coordinate rep- 
resentation, which is reducible, and apply the constraint p? + m? = 0 to get an 


irreducible one. 


(2) Find arbitrary, finite-dimensional, irreducible representations of the Lorentz 


group. This we have done in the previous section. 


(3) Take the direct product of these two representations of the Poincaré group, which 
give the orbital and spin parts of the generators. (The spin part of translations 
vanishes.) We then need a further constraint to pick out an irreducible unitary 


piece of this product, which is the subject of this section. 


1. Field equations 


We have already constrained the momentum: The equation 
p +m’ =0 


as an operator equation acting on a field or wave function is the “Klein-Gordon (or 
relativistic Schrodinger) equation”. States or fields that satisfy their field equations 
are called “on-(mass-)shell”, while those that don’t (or for which the equations haven’t 


been imposed) are “off-shell”. 


The next step is to constrain the “spin” (actually, its Lorentz generalization). 
The basic idea of the extra constraint is very simple: The Lorentz group introduces 
states of negative probability, since the Minkowski space metric is indefinite. For 
example, if we write the naive Lorentz invariant Hilbert-space norm for a vector wave 
function, the time component will have negative probability. (Similar remarks apply 
to spinors, e.g., for the metric Y ~ 7° for the Dirac spinor.) The solution to this 
problem, in first-quantized operator language, is to constrain the spin to eliminate 
the negative-metric component, in analogy to the way we have already constrained 


the momentum by the Klein-Gordon equation. We thus impose 


Sa? De + Wp = 0 


to kill the part of the Lorentz generators in the direction of the momentum, where “w 


is a constant to be determined. (Its term can be attributed to ordering ambiguities. ) 
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This equation is the general field equation for all spins (acting on the field strength), 


in addition to the Klein-Gordon equation (which is redundant except for spin 0). 


We will see that this constraint is appropriate for massless particles. Massive 
particles then follow from dimensional reduction: adding a further spatial dimension 


and fixing its component of momentum to a constant, the mass, so p? = p? + m?. 


Before examining this constraint, we first give an alternative “derivation” based 
on the conformal group. The earlier derivation of the massless particle from the con- 
formal particle for spin 0 can be generalized to all “spins”, i.e., all representations of 
the Poincaré group in arbitrary dimensions. There is a way to do this in terms of 
classical mechanics for all representations of the conformal group, by generalizing the 
description of the classical spinning particle. However, by analyzing the conformal 
particle quantum mechanically instead, applying a set of constraints, it will be clear 
how to generalize from conformal particles to general massless particles by weakening 
the constraints. The general idea is that the symmetry group for massive particles is 
the Poincaré group, while that for massless particles includes also scale transforma- 
tions, and finally conformal particles have also conformal boosts. So, starting with 
the conformal group and dropping anything to do with conformal boosts will give 


massless particles. 


We begin with a general representation of the conformal group SO(D,2) in terms 
of generators G4p, where A, B are D+2-component vector indices. We then impose 


constraints that are the conformally covariant form of p? = 0: Identifying 
(G**,G",G*-,G-*) = eeu wa AI) 


(where A = (+,a)) as the generators for translations, Lorentz transformations, di- 


latations, and conformal boosts, we see that 


Gee = LGlAGa*) _ po PG Gen =f 


is an irreducible piece of the product GG (symmetric and traceless) and includes: 


Gr, qr Ge. Cre. Go. G_~) = (P?, sd”, P,} oe s{A, P*, we) 


66 % 


where all have terms containing K°. 


Exercise ITB1.1 
Work out all the G’s in terms of P, J, A, and K. 


In general theories, even massless ones, it is not always possible to have invariance 


under conformal boosts. (We’ll see examples of this insubsection IXA7.) However, all 
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massless theories are scale invariant, at least at the free level. (In D=4, free massless 
theories can always be made conformal on shell. However, the fact that even these 
theories can have actions that are not invariant under conformal boosts proves that it 
is sufficient to add just dilatations to the Poincaré group. Furthermore, the fact that 
conformal boosts are not always an invariance in D>4 means that dropping them will 
give results in a dimension-independent form.) Therefore, only Gtt and Gt* can be 
defined in general massless theories, but we’ll see that these are sufficient to define 
the kinematics. The former is just the masslessness condition, which we used to pick 


the constraints in the first place. 


As we saw earlier, A just scales x*: We can therefore write the relevant generators 


as 
P=, JP aakals 5”, A= d{2",a}+w-1=2°+wt+ 


(We have used the antihermitian form of the generators.) The “scale weight” w+ 24 
is the real “spin” part of A, just as S,, is the spin part of the angular momentum 
Jap. To preserve the algebra it must commute with everything, and thus we can 
set it equal to a constant on an irreducible representation. We’ll see shortly that 
its value is actually determined by the spin Sg. It is the engineering dimension of 
the corresponding field. It has been normalized for later convenience; the value of 
w depends on the representation of S®, but is independent of D. The dilatation 
generator A is not exactly antihermitian because the integration measure d?x isn’t 
invariant under scaling. This is another reason w is determined, by the free action. 
The form we have given preserves reality of fields. The commutation relations for the 


spin parts, and the total generators, are the same as those for the orbital parts; e.g., 
[Sas, $4] = —4/5 S54 


(A convenient mnemonic for evaluating this commutator in general is to use $7 = 
a! instead.) 
Exercise IITB1.2 
We can also use this method to find the stronger conditions for the fully 
conformal case: 
a Find an expression for AK“ in terms of x, 0, S, and w that preserves the 
commutation relations. 
b Evaluate all the constraints G, and express the independent ones in terms of 


just O, S, and w (no 2). 
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Substituting the explicit representation of the generators into the constraint Gr“, 
and using the former constraint P? = 0 (when acting on wave functions on the right), 


we find that all x dependence drops out, leaving for Gt® the condition 
5, Pop + WOn = 0 


(paying careful attention to quantum mechanical ordering). 


Exercise IIB1.3 
Define spin for the conformal group by starting in D+2 dimensions: In terms 


of the (D+2)-dimensional coordinates y4 and their derivatives 04, 
Gap = y[AOp) + Sap 
Besides the previous conditions 
yi = 8 = {y4,043 =0 


impose the constraints, in analogy to the D-dimensional field equations, and 


taking into account the symmetry between y and 0, 


Sa? yp +wy,a = Wager + wd, = 0 


a Show that the algebra of constraints closes, if we include the additional con- 
straint 
554° Spo + w(wt 2)nap =0 
b Solve all the constraints with explicit y’s for everything with an upper “—” 
index, reducing the manifest symmetry to SO(D—1,1), in analogy to the way 


y” = 0 was solved to find y~. 


c Write all the conformal generators in terms of 7, Og, Sap, and w. 


2. Examples 


We now examine the constraints S,’0, + w0, = 0 in more detail. We begin by 


looking at some simple (but useful) examples. The simplest case is spin 0: 


Sab 0 > W 0 


The next simplest case (for arbitrary dimension) is the Dirac spinor (see subsections 
IC1 and ITA6): 


Sab = —43/Ya, Ye > Be Os + WOn = —ay’ Op + (w _ 5) Oa 
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= O=0, we 


where we have separated out the pieces of the constraint that are irreducible with 
respect to the Lorentz group (e.g., by multiplying on the left with y*). This gives 
the (massless) “Dirac equation” @W = 0. The next case is the vector: In terms of the 
basis |V) = V“|,), the spin is (see subsection IB5) 


Dab = |[a) Ol 


However, the vector yields just another description of the scalar: 
Exercise IIB2.1 
Apply the field equations for general field strengths to the case of a vector 
field strength. 
a Find the independent field equations (assuming the field strength is not just 
a constant) 
aly = 0, OF, =, w= 1 


Note that solving the first equation determines the vector in terms of a scalar, 
while the second then gives the Klein-Gordon equation for that scalar, and 
the third fixes the weight of the scalar to be the same as that found by starting 

with a scalar field strength. 
b Lorentz covariantly solve the second equation first to find a gauge field that 

is not a scalar. 

All other representations can be built up from the spinor and vector. As our final 
example, we consider the case where the field is a 2nd-rank antisymmetric tensor: 
Using the direct product representation (applied as in subsection IB2 given the vector 


representation) 
F=F™|,)@ |»);  Sab(lc) ® la)) = (Sable)) ® la) + le) ® Sabla) 
= (Sor) =o Be 


we find the equations 
(Sa°dp =p WOa) Fea = 50a Peay fos Na[cO” Fay a (w _ 1)0oF ca 


> alec] = Ors = 0, w= 1 


which are Maxwell’s equations, again separating out irreducible pieces (e.g., by tracing 


and antisymmetrizing). 
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Exercise ITB2.2 
Verify the representation of Lorentz spin given above for F,, by finding the 


commutation relations implied by this representation. 


Exercise ITB2.3 
Use the definition of the action of the Lorentz generators on a vector in vector 


and spinor notations, 
Sab = [[a) (yl, la) = la) ® la) 


Sap =|(a){al, Sag = lata 


to derive 


Exercise IITB2.4 
Consider the field equations in 4D spinor notation for a general field strength, 


totally symmetric in its m undotted indices and n dotted indices, 
Sa" Os = MO = eg = 10.95 = WwW = 3(m a n) 


a Show this implies 


b Translate the field equations into vector notation (in terms of S,,), finding 


S,°0, + wO, = 0 and an axial vector equation. 


c Show that the two equations are equivalent by deriving the equations of part 
a from S,°0; + w0, = 0 alone, and from the axial equation alone (except that 


the axial equation doesn’t work for the cases m = 7). 


In each case, choosing the wrong scale weight w would imply the field was con- 
stant. Note that we chose the field strength F.», to describe electromagnetism: The 
arguments we used to derive field equations were based on physical degrees of free- 
dom, and did not take gauge invariance into account. In chapter XII we use more 
powerful methods to find the gauge covariant field equations for the gauge fields, and 


their actions. 
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3. Solution 


Free field equations can be solved easily in momentum space. Then the simplest 
way to do the algebra is in the “lightcone frame”. This is a reference frame, obtained 


by a Lorentz transformation, where a massless momentum takes the simple form 
p° = ofp 


(using only rotations), or the even simpler form p* = +64 (using also a Lorentz 
boost), where again + is the sign of the energy. In that frame the general field 


equation 5,°0, + wd, = 0 reduces to 
S*=0, w= St 


The constraint S~* = 0 determines S*~ to take its maximum possible value within 
that irreducible representation, since S~* is the raising operators for St~: For any 


eigenstate of ST, 


S*-|h) =hlh) => S*-(S~Ih)) =(S-igt 4 [S*-, Sn) = (A+ (54a) 


The remaining constraint then determines w: It is the maximum value of S'*~ for 


that representation. By parity (+ < —), —w is the minimum, so 
w>0; w=0 6 Sv®=0 


since if S*+~ = 0 for all states then S® = 0 by Lorentz transformation. As we have 
seen by other methods (but can easily be derived by this method), w = 5 for the 
Dirac spinor and w = 1 for the vector; since general representations can be built from 
reducing direct products of these, we see that w is an integer for bosons and half- 
integer for fermions. If we describe a general irreducible representation by a Young 
tableau for SO(D—1,1) (with tracelessness imposed), or a Young tableau times a 
spinor (with also y-tracelessness y“Wa...5 = 0), then it is easy to see from the results 
for the spinor and vector, and antisymmetry in rows, that w is simply the number 
of columns of the tableau (its “width”), counting a spinor index as half a column: 


% 


S*~ just counts the maximum number of “—” indices that can be stuck in the boxes 
describing the basis elements. (In fact, Dirac spinor ® Dirac spinor gives just all 


possible 1-column representations. ) 


This leaves undetermined only S* and S**. However, S** (“creation operator” ) 
is canonically conjugate to S~* (“annihilation operator”), so its action has also been 
fixed: 

[Sto] js st 458 
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(S% vanishes for i = 7, so St’ and S~ are conjugate, though not “orthonormal”. The 
constant S'*~ was fixed above to be nonvanishing, except for the trivial case of spin 
0.) Equivalently, S“ preserves S~’ = 0, while S** doesn’t: S” are the only nontrivial 


spin operators acting within the subspace satisfying the constraint. 


Thus only the “little group” SO(D—2) spin 8” remains nontrivial: The original 
irreducible representation of SO(D—1,1) Lorentz spin S® was a reducible representa- 
tion of SO(D—2) spin $”; the irreducible SO(D—2) representation with the highest 
value of St~ is picked out of this SO(D—1,1) representation. This solution also gives 
the field strength in terms of the gauge field: Working with just the highest-S*7- 
weight states is equivalent to working with the gauge field, up to factors of OT. 

As an explicit example, for spin 1/2 we have simply 7 W = 0, which kills half the 
components, leaving the half given by ytW. For spin 1, we find 

Pi2=0 =< Fea 
peFrel—0 = only Ft*740 
In the “lightcone gauge” At = 0, we have Ft’ = 0* A’, so the highest-weight part of 


F is the transverse part of the gauge field. The general pattern, in terms of field 


\ => only Ft'40 


strengths, is then to keep only pieces with as many as possible upper + indices and 
no upper — indices (and thus highest St~ weight). In terms of the vector potential, 
we have 

Fe ~plt4l = only A’ #0 
The general rule for the gauge field is to drop + indices, so the field becomes an 
irreducible representation of SO(D—2). All + indices on the field strength are picked 
up by the momenta, which also account for the scale weight of the field strength: All 


gauge fields have w = 0 for bosons and w = $ for fermions. 


Exercise ITB3.1 


Using only the anticommutation relations {y*, y’} = —n%, construct projec- 


tion operators from y~: These are operators I/; that satisfy 


Reis ier: (no =]. a =1 


Because of time reversal symmetry yt <@ —y~ (or parity yt <— y~), these 
project onto two subspaces equal in size. 

A method equivalent to using the lightcone frame is to perform a unitary trans- 

formation U on the spin that is the inverse of the transformation on the coordi- 

nates/momentum that would take us to the lightcone frame: We want a Lorentz 


transformation A,” on the field equations, which are of the form 


On" Ds = NP OF = on” as wee 
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that has the effect 
UO,{U*=Af0F",, Lp? pain = 
0=U0,°p,U-! = A,°O2A° py, = APOen, => O.°p;, = 


If |w) satisfies the original constraint, then U|y) will satisfy the new one. If we like, 
we can always transform back at the end. This is equivalent to a gauge transformation 
in the field theory. 


It is easy to check that the appropriate operator is 
et ee 
Any operator V® that transforms as a vector under S@, 
[se V4 = VleyPle 


but commutes with p, is transformed by U into UVU~! = V’ as 


(p')? 
2(pt)? 


as follows from explicit Taylor expansion, which terminates because S** act as low- 


vos. Vi=vigvte, Vo =v-4Vib svt 
D D 


ering operators (as for conformal boosts in subsection IA6). This yields the desired 
result . yt, 
Vv" = Vv" — 
when we impose the field equation p? = 0. 
Exercise IIB3.2 
Check this result by performing the transformation explicitly on the con- 
straint. Before the transformation, the lightcone decomposition of the con- 
straint is 
(—S*~ + w)pt + Sty’ = 0 
—S'~pt + S%p) + wp' — S'tp” =0 
Sp + (-S"* +w)p =0 


Show that after this transformation, the constraint becomes 


(—S*~ + w)p* =0 
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S-tpi + (—St- +w) ~_ gt PB! gti ad 
Clearly these imply 
=o, S*=0 
with p? = 0. 

On the other hand, if instead of using the lightcone identification of x* as “time”, 
we choose to use the usual x° for purposes of finding the evolution of the system, then 
we want to consider transformations that do not involve p°, instead of not involving 
the “energy” p-. Thus, by p°-independent rotations alone, the best we can do is to 
choose 

p = 0, p =W 
i.e., we can fix the value of the spatial momentum, but not in a way that relates to 
the sign of the energy. The result is then 


p >0t p=iip 
yp <0: pt=d*p 
The result is similar to before, but now the positive and negative energy solutions are 
separated: In this frame the field equations reduce to 
p>0: St*=0, St =w 
p <0: Stt=0, St =-w 


Thus, while w takes the same value as before, now the positive-energy states are 
associated with the highest weight of St~, while the negative-energy ones go with 
the lowest weight (and nothing between). The unitary transformation that achieves 
this result is a spin rotation that rotates S@ in the field equations with the same 
effect as an orbital transformation that would rotate (p', p') > (w,0). By looking at 
the special case D = 3 (where there is only one rotation generator), we easily find 
the explicit transformation 


U = exp tan (4) ie a | 
p Ip"| 
Exercise ITB3.3 


Perform this transformation: 


a Find the action of the above transformation on an arbitrary vector V*. (Hint: 
Look at D = 3 to get the transformation on the “longitudinal” part of the 
vector.) In particular, show that 


V"pa=V"p,, p= Oop? + dtw 
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b Show the field equations are transformed as 
Sp, 4 wp" + «ye. wp = p(w _ eas _ Lg 
Sp, i. wp! iy 1p} (WS™ _ ps”) + p(w -_ oh 
Spa twp) => —[5 — TA p'p!|(wS™ — pS”) + p'(w - £8") 


Note that the first equation gives the time-dependent Schrodinger equation, 


with Hamiltonian 
Pies Legis _ Spi) a tems 


This diagonalizes the Hamiltonian H (in a representation where S"° is diag- 


onal). Thus the only independent equations are 
p=, 5° = e(p)w, St — e(p®)S% = 0 


leading to the advertised result. 


c Find the transformation that rotates to the p’ direction instead of the 1 di- 
rection, so 


H- ele 
P| 


4. Mass 


So far we have considered only massless theories. We now introduce masses 
by “dimensional reduction”, identifying mass with the component of momentum in 
an extra dimension. As with the extra dimensions used for describing conformal 
symmetry, this extra dimension is just a mathematical construct used to give a simple 
derivation. (Theories have been postulated with extra, unseen dimensions that are 
hidden by “compactification”: Space curls up in those directions to a size too small 
to detect with present experiments. However, no compelling reason has been given 


for why the extra dimensions should want to compactify.) 
The method is to: 


(1) extend the range of vector indices by one additional spatial direction, which we 


call “—1”; 


(2) set the corresponding component of momentum to equal the mass, 


p-1=™ 
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and 


(3) introduce extra factors of i to restore reality, since 0_, = ip_; = im, by a unitary 


transformation. 


Since all representations can be constructed by direct products of the vector and 
spinor, it’s sufficient to define this last step on them. For the scalar this method is 
trivial, since then simply p? — p?+m?. Except for the last step, the other constraint 


becomes 


gees Sa,14m + wd, =0, S_1°0, + wim =0 


For the spinor, since any transformation on the spinor index can be written in 
terms of the gamma matrices, and the transformation must affect only the —1 direc- 
tion, we can use only y_;. (For even dimensions, we can identify the y_; of dimen- 
sional reduction with the one coming from the product of all the other y’s, since in 


odd dimensions the product of all the y’s is proportional to the identity.) We find 
U = exp(—my-1/2V2) : 13-1, Ya 7 —V 29-17 


We perform this transformation directly on the spin operators appearing in the con- 
straints, or the inverse transformation on the states. Dimensional reduction, followed 


by this transformation, then modifies the massless equation of motion as 
if = 19 — my > —V27-1(i9 + 3B) 
so igV =0 > (19 + )¥ =0. 
The prescription for the vector is 
U=exp(sim) 2 Pd), Ca (=) 


with the other basis states unchanged. This has the effect of giving each field a —2 


for each (—1)-index. For example, for Maxwell’s equations 


al bc] Ora Fhe (redundant) 
Oral be] — : —? la 
Oil 54 7 ml ap —1(OjaFy-1 + mF op) 
b Ors + mF a4 OF» + mF 
OF» = 
OF 4a —10°F 14 (redundant) 


Note that only the mass-independent equations are redundant. Also, F,-; appears 
explicitly as the potential for F,,, but without gauge invariance. Alternatively, we 
can keep the gauge potential: 
Fay = Oe Ay { Fan = OaApy 

—_ 


Pap = OA 
b [ato] te — O,A_1 — imA, iF 4= —i(O,A_1 ai mA,) 
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This is known as the “Stiickelberg formalism” for a massive vector, which maintains 
gauge invariance by having a scalar A_, in addition to the vector: The gauge trans- 


formations are now 


5A 3) dAg = —Onr dA, = —O,A 
i. . §A_1 = —imA —iSA_, = —imd 
Exercise IIB4.1 
Consider the general massive field equations that follow from the general 


massless ones by dimensional reduction. One of these is 
S_1°0, + wim = 0 


(before restoring reality). This scalar equation alone gives the complete field 


equations for w=1/2 and 1 (antisymmetric tensors), 0 being trivial. 
a Show that for w=1/2 it gives the (massive) Dirac equation. 


b Expanding the state over explicit fields, find the covariant field equations it 
implies for w=1. Show these are sufficient to describe spins 0 (vector field 
strength: see exercise IIB2.1) and 1 (F,, and F,_,). Note that S~!* act as 
generalized y matrices (the Dirac matrices for spin 1/2, the “Duffin-Kemmer 
matrices” for w=1), where 


gab = —(s-™, ial 


c Show that these covariant field equations imply the Klein-Gordon equation 
for arbitrary antisymmetric tensors. Show that in D=4 all antisymmetric 
tensors (coming from 0-5 indices in D=5) are equivalent to either spin 0 or 


spin 1, or trivial. (Hint: Use €gpcq-) 


d Consider the reducible representation coming from the direct product of two 


Dirac spinors, and represent the wave function itself as a matrix: 
SWW = Sip + ws 


where i = (—1,a) and S“ is the usual Dirac-spinor representation. Using the 
fact that any 4x4 (in D=4) matrix can be written as a linear combination of 
products of y-matrices (antisymmetric products, since symmetrization yields 
anticommutators), find the irreducible representations of SO(4,1) in W, and 


relate to part c. 
Exercise IITB4.2 


Solve the field equations for massive spins 1/2 and 1 in momentum space by 
going to the rest frame. 
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The solution to the general massive field equations can also be found by going to 
the rest frame (p? = m): The combination of that and dimensional reduction is, in 


terms of the massive analog of lightcone components, 
pi=+(P+p")=V2m, p= (p’-p*)=0,  p' =0 


where p’ are now the other D—1 (spatial) components. This fixing of the momentum 
is the same as the lightcone frame except that p' has been replaced by p~', and 
thus p’ now has D—1 components instead of D—2. The solution to the constraints is 
thus also the same, except that we are left with an irreducible representation of the 
“little group” SO(D—1) as found in the rest frame for the massive particle, vs. one 


of SO(D—2) found in the lightcone frame for the massless case. 


5. Foldy-Wouthuysen 


The other frame we used for the massless analysis, which involved only energy- 
independent rotations, can also be applied to the massive case by dimensional reduc- 
tion. The result is known as the “Foldy-Wouthuysen transformation” , and is useful for 
analyzing interacting massive field equations in the nonrelativistic limit. Replacing 


p' + p-| =m in our previous result, we have for the free case 


LU = exp fan (=) sue] F UHU 1 =18-%w 
m IP . 
For purposes of generalization to interactions, it was important that in the free trans- 
formation (1) we used only the spin part of a rotation, since the orbital part could 
introduce explicit 2 dependence, and (2) we used only rotations, since a Lorentz 
boost would introduce p® dependence in the “parameters” of the transformation, 


which could generate additional p° (time derivative) terms in the field equation. 


Exercise ITB5.1 
Perform this transformation for the Dirac spinor, and then apply the reality- 


restoring transformation to obtain 
H- V2y0w 


We then can use the diagonal representation yo = e he /V2. (We can de- 
fine this representation, up to phases, by switching yo and y_; of the usual 
representation.) In general the reality-restoring transformation will be unnec- 
essary for any spin, since applying the field equation S~!° = +w picks out a 


representation of the “little group” SO(D-—1). 
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In the interacting case the result generally can’t be obtained in closed form, so it is 
derived perturbatively in 1/m. The goal is again a Hamiltonian diagonal with respect 
to S~'°, to preserve the separation of positive and negative energies; we then can set 
S~!° — w to describe just positive energies. We thus choose the transformation to 
cancel any terms in H that are off-diagonal, which come from odd total numbers of 
“1” and “0” indices from the spin factors in any term: i.e., odd numbers of S® 
and S~" (e.g., the Sp; term in the original H). For example, for coupling to an 
electromagnetic field, the exponent of U is generalized by covariantizing derivatives 
(minimal coupling 0 — V = 0+7A), but also requires field-strength (£ and B) terms 
to cancel certain ones of those generated from commutators of these derivatives in 


the transformation: 
Viens eee Ye = [Vv?,. Vv) S08” 


Before performing this transformation explicitly for the first few orders, we con- 
sider some general properties that will allow us to collect similar terms in advance. 
(Few duplicate terms would appear to the order we consider, but they breed like 
rabbits at higher orders.) We start with a field equation F that can be separated into 


“even” terms € and “odd” ones O, each of which can be expanded in powers of 1/m: 


Foe+o: £25 we, 0=S m0, 
n=0 


n=—-1 


Note that the leading (mt!) term is even; thus we choose only odd generators to 


transform away the odd terms in ¥, perturbatively from this leading term: 
Poehe, Ga) iG. 


Since F’ is even while G is odd, we can separate this equation into its even and odd 


parts as 
F' = cosh(Le)E + sinh(Lo)O 
0 = sinh(Le)E + cosh(Lg)O 
(with Le = |G, | as in subsection IA3). Since we can perturbatively invert any 


Taylor-expandable function of 2g that begins with 1, we can use the second equation 


to give a recursion relation for G,,: Separating the leading term of Ff, 


E=mE{+AE,  —mI/G,E4] =[G, AE] + Le coth(Le)O 
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which we can expand in 1/m [after Taylor expanding £L¢ coth(Lq@)] to give an expres- 
sion for [G,,,€_;| to solve for G,,. We can also use the implicit solution for [G, €] 


directly to simplify the expression for F’: 
F' =E + tanh($Ll_)O 
For example, to order 1/m? we have for F’ 
Fe Fy op; Fi = &, + $[Gi, Oo] 


Fy = Ex + 4[G2, Oo] + 4[G1, O1] 
To this order we therefore need to solve 


—[Gi, E-1] = Oo, —[Go, €_1| = O1 + [Gi, Ep] 


For our applications we will always have 
EE, =2—25 " 


unchanged by interactions. We have oversimplified things a bit in the above deriva- 
tion: For general spin we need to consider more than just even and odd terms; we 


need to consider all eigenvalues of S~!°: 
[s-* F.] = sF 


and find the transformation that makes F’ commute with it (s = 0). The procedure 
is to first divide into even and odd values of s, as above, then to divide the remaining 
even terms in F’ into twice even values of s (multiples of 4) as the new €’ and twice 
odd as the new ©’, which are transformed away with the new twice odd G’, and so 
on. This very rapidly removes the lower nonzero values of |s| (1 — 2 — 4 — ...), 
which has a maximum value of 2w (from the operators that mix the maximum value 
S~ = w with the minimum $~'° = —w). For example, for the case of most interest, 
the Dirac spinor, the only eigenvalues (for operators) are 0 and +1, so the original 
even part does commute with S~'°, and the procedure need be applied only once. 
Furthermore, terms in F of eigenvalue s can be generated only at order m!~* or 
higher; so at any given order the procedure rapidly removes all undesired terms for 
any spin. 

Since the terms we want to cancel are exactly the ones with nonvanishing eigen- 
values of S~!°, they can always be written as [G, S~!°] for some G, so we can always 


find a transformation to eliminate them: 


[S?, Gn] =sGen => Gen =—“{[G, AE] + Le coth(La)O} sn 
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(This is just diagonalization of a Hermitian matrix in operator language.) In partic- 
ular for the Dirac spinor, since €_; has only +1 eigenvalues, it’s easy to see that not 
only do all even operators commute with it, but all odd operators anticommute with 
it. (Consider the diagonal representation of €_;: {(} °,),(° *)} = 0.) We then have 
simply 


w= $ > tear =] > [E_1, AE| = {E_1, O} = {E_1, G} = 0 
=> mG=—4{[G, AE] + Lecoth(Le) OE 


As a final step, we can apply the usual transformation 


— imtS719/w 
Uo =e€E 


which commutes with all but the p® term in &€ to have the sole effect of canceling 
€_,, eliminating the rest-mass term from the nonrelativistic-style expression for the 


energy. 


For the minimal electromagnetic coupling described above, we have besides €_; 
Eo = n°, Oo = 1 Sn 


where we have written 7* = p* + A® (instead of 7* = —iV*, to save some i’s). There 
are no additional terms in F for minimal coupling for spin 1/2, but later we’ll need to 
include nonminimal effective couplings coming from quantum (field theoretic) effects. 
There are also extra terms for spins 0 and 1 because the field strength is not the same 
as the fundamental field, so we'll treat only spin 1/2 here, but we’ll continue to use 
the general notation to illustrate the procedure. Using the above results, we find to 
order 1/m? for F’ 
CS5 Go = wS*iF™ 


in agreement with with the free case up to field strength terms. The diagonalized 


Schrodinger equation is then to this order, including the effect of Uo, 
Fl=0, Foam, Fos —gyla{S-, SUR! + SO(n)" 
Fy = —qU{SY, SUP) — SULLEM, 17}) 


For spin 1/2 we are done, but for other spins we would need a further transformation 
(before Up) to pick out the part of Fi that commutes with S~!° (by eliminating the 
twice odd part); the final result is 


Fy = HR({S, SY} — (8%, SH} GF) + SHEE™, WY} 
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It can also be convenient to translate into + notation (as for the massless case, but 
with index 1 — —1): We then write 


Fiy=0, Fos, Fra —gyla{S", S7FY + 84 (n'y) 


Fy = —Zh{S*6, SO }(GFM) — SHGF% aw} 


1 
4 
In this notation the eigenvalue of S*~ = S~!'° for any combination of spin operators 


can be simply read off as the number of — indices minus the number of +. 


Exercise IIB5.2 
Find the Hamiltonian for spin 1/2 in background electromagnetism, expanded 
nonrelativistically to this order, by substituting the appropriate expressions 
for the spin operators in terms of y matrices, and applying St~ = +w on 
the right for positive/negative energy. (Ignore the reality-restoring transfor- 
mation.) y-matrix algebra can be performed directly with the spin operators: 
For the Dirac spinor we have the identities 


by sa” — 30( 0) ~ Nac =e ie, o 4} = al — 284 S+~ 


avc 


6. Twistors 


Besides describing spin 1/2, spinors provide a convenient way to solve the condi- 
tion p? = 0 covariantly: Any hermitian matrix with vanishing determinant must have 
a zero eigenvalue (consider the diagonalized matrix), and so such a 2x2 matrix can 
be simply expressed in terms of its other eigenvector. Absorbing all but the sign of 


the nontrivial eigenvalue into the normalization of the eigenvector, we have 
rS0 > gress 


for some spinor p® (where p* = (p%)*). Since p® is the (canonical) energy, the + 
is the sign of the energy. This explains why time reversal (actually CT in the usual 
terminology) is not a linear transformation. Note that p® is a commuting object, while 
most spinors are fermionic, and thus anticommuting (at least in quantum theory). 


Such commuting spinors are called “twistors”. 


Exercise IIB6.1 
Show that, in terms of its energy FE and the angular direction (6,¢) (with 
respect to the “1” axis) of its velocity, a massless particle is described by the 


twistor 
a _ 91/4 6 16/2 or 0,—ib/2 
p* =2 VJ |E|(cosse ? singe /2) 
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One useful way to think of twistors is in terms of the lightcone frame. In spinor 
notation, the momentum is 

p= +(99) 

If we write an arbitrary massless momentum as a Lorentz transformation from this 

lightcone frame, then the twistor is just the part of the SL(2,C) matrix that con- 
tributes: 
pe =" 9,o" = 0,0. oy oy = te Gs =a" 

For this reason, the twistor formalism can be understood as a Lorentz covariant form 


of the lightcone formalism. 


The twistor construction thus gives a covariant way of constructing wave functions 
satisfying the mass-shell condition (Klein-Gordon equation) for the massless case, 
Ow = 0, where O = 0? = —p?. We simply Fourier transform, and use the twistor 
expression for the momentum, writing the momentum-space wave function in terms 


of twistor variables (“Penrose transform” ): 


Y(a) = / d* pod palexp(ix*’ PoP 3) X+ (Par Da) + exp(—it’ pads)x—(Pas Pa) 


where y+ describe the positive- and negative-energy states, respectively. (The integral 


over pz can be performed also, effectively taking the Fourier transform with respect 


to that variable only, treating 47° p, as the conjugate.) 


We can extend the matrix notation of subsection ITA5-6 to twistors: 


Gra, ‘p= \pe l=" lal, 


P = |p)(pl, — P* = |p|{p| 


As a result, we also have for twistors 


(pq) =— (ap), —|pa] = —lapl; 


(pq) (rs) + (ar)(ps) + (rp) (qs) = 0 


These properties do not apply to physical, anticommuting spinors, where (wy) = 
+(xw), and (pp) #0. 


Another natural way to understand twistors is through the conformal group. We 
have already seen that the conformal group in D dimensions is SO(D,2). Since this 
group in four dimensions is the same as SU(2,2), it’s simpler to describe its general 
representations (and in particular spinors) in SU(2,2) spinor notation. Then the 
simplest way to generate representations of this group is to use spinor coordinates: 


We therefore write the generators as (see subsection IC1) 


Ga? = CPCa — 404C6°Ce 
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where we have subtracted out the trace piece to reduce U(2,2) to SU(2,2) and, consis- 
tently with the group transformation properties under complex conjugation, we have 
chosen the complex conjugate of the spinor to also be the canonical conjugate: The 
Poisson bracket is defined by 
[Cas ¢"] = O% 

To compare with four-dimensional notation, we reduce this four-component spinor by 
recognizing it as a particular use of the Dirac spinor. Using the same representation 
as in subsection IIA6, we write 


An (a), Ces lute T= ( 
Now the Poisson brackets are 


wap] = 58, (aa, p*] = 58 


a 


0 a 
cw 9 


The group generators themselves reduce to 
PoPgr YoWg, Plas); D(aW§y),  P°Wa + P"Wa — 2 
(for E > 0, with an overall — for E < 0), which are translations, conformal boosts, 


SL(2,C) generators and their complex conjugates, and dilatations. 


Another kind of twistor, related to position space instead of momentum space, 
follows from this (D+2)-coordinate description of conformal symmetry for D=4 (see 
subsection IA6). In practice, it’s more convenient to work with invariances than con- 
straints. In this case, we can solve the lightcone constraint on Wick-rotated D=3+3 
or 5+1 space, replacing 6-component conformal vector indices with 4-component con- 
formal spinor indices, with a position-space twistor: 


Aa ,B 


ABCD _Q 5 AB _ AaB 


y? = <€aBeDy i] 
where A is an SL(4) (or SU*(4)) index and a is an SL(2) (or SU(2)) index, and 24° 
is real (with either two real or two pseudoreal indices). (Here the SL groups apply 
to 3+3 dimensions, the SU groups to 5+1.) Whereas y had 6 — 1 = 5 components 
due to the constraint, z has 4-2 — 3 = 5 components due to the SL(2) (SU(2)) gauge 
invariance of the above relation to y. These coordinates reduce to the usual by an 
SL(2) transformation: 

A = (u, 4), “= de (Os yr) =  SL(2) gauge ro” = rg 
where e = ?. 
Exercise IIB6.2 
Substitute this spinor-notation z(\,x) into y ~ z? and compare with the 


vector-notation y(e, x) of subsection [A6. 
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7. Helicity 
A sometimes-useful way to treat the transverse spin operators S$” is in terms of 
Ware = 5 Pave i 5 Pia Ste} 


which (like the field equations) can be written in terms of just the Poincaré generators. 
This is the part of S,, whose commutator with the field equations is proportional to 
the field equations (i.e., it preserves the constraints). For the massive case, it reduces 
to 8” for SO(D-—1) in the rest frame; for the massless case in the lightcone frame, 
using the field equations it again reduces to S”, but now for just SO(D—2). In D=4 


this is the “Pauli-Lubaniski (axial) vector” 


1 bcd 
WwW, ==-W €bcda 


6 
We can choose our states to be eigenstates of a component of it: For example, for 
massless states W°/P® is called the “helicity”. For massive states the helicity is 
defined as W°/ |P |, but is less useful, especially since it is undefined (0/0) in the rest 
frame. In that case one instead chooses a component in terms of a (momentum- 
dependent axial) vector s* as s*W,, where s*P, = 0 and s? = 1/m?. 
Exercise IIB7.1 
Show in both the massless and massive cases that W,,. reduces to the little 
group generators on shell by going to the appropriate reference frame. 
The twistor representation of the conformal group does not give the most general 
representation, but it does give all the (free) massless ones. The reason it gives 


massless ones is that this representation satisfies the constraint (see subsection ITB1) 
GiB] leD] _ Gu Gp)” — traces = 0 


which includes p? = 0 as well as all the equations that follow from p? = 0 by conformal 


transformations. As a consequence, this representation also satisfies 
GAlGe® — trace = hGP 


where h is the helicity. This equation may be more recognizable in SO(4,2) notation, 
as 


ge PCPEPGODGerr = ihnG4® 


This equation includes, as its lowest mass-dimension part (as defined by dilatations), 


the Pauli-Lubanski vector 


We Sse Pg th” 
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(The “i” appears in the last two equations only when we use the antihermitian form of 
the generators G4p and J,».) Although any massless representation of the conformal 
group satisfies the above conditions (see exercise I1B2.3), the twistor representation 
satisfies the unusual property that helicity is realized as a linear transformation on the 
coordinates: For the twistors the implicit definition of helicity can be solved explicitly 
to give 
h= HOA, Cah = 304 + 1 = 3 (p%wa — pa) 

(also for EF > 0), which is exactly the U(1) transformation of U(2,2)=SU(2,2)@U(1). 
(This is similar to SU(2) in terms of “twistors”: See exercise IC1.1.) On functions of 


Do and pa, it effectively just counts half the number of p,’s minus pg’s. 


Exercise IIB7.2 
These results are pretty clear from symmetry, but we should do some algebra 
to check coefficients: Express Ja, and P, in terms of the twistors pa, Pa, Wa, Wa 
(see also exercise IIB2.3 for normalization), and plug into ePJ = ihP to derive 


the above expression of h in terms of twistors. 


The simple form of the helicity in the twistor formalism is another consequence 
of it being a covariantized lightcone formalism. In the lightcone frame, there is still 
a residual Lorentz invariance; in particular, a rotation about the spatial direction in 
which the momentum points leaves the momentum invariant. This is another defini- 
tion of the helicity, as the part of the angular momentum performing that rotation. 
(Only spin contributes, since by definition the momentum is not rotated.) Since the 
product of two Lorentz transformations is another one, this rotation can be inter- 
preted as a transformation acting on the Lorentz transformation to the lightcone 
frame, i.e., on the twistor, such that the momentum is invariant. This is simply a 


phase transformation: 
ef 0 Y ; 
/ / 8 
os s) gy => pl =e p® 
a 
We can generalize the Penrose transform in a simple way to wave functions car- 


rying indices to describe spin: 


VY ictus Bi ean (x) = | dyad’ Da Pa, ** ‘PomP3, _ "PZ 


x [exp(ix™’pad3)X+ (Pas Pa) + exp(—ix” paps) X- (Da, Pa)] 


For the integral to give a nonvanishing result, the integrand must be invariant under 


the U(1) transformation generated by the helicity operator h: In other words, y+ must 


have a transformation under h, i.e., a certain helicity, that is exactly the opposite that 
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of the explicit p factors that carry the external indices to give a contribution to the 
integral, since otherwise integrating over the phase of pg would average it to zero. 
(Explicitly, if we derive the helicity by acting on the Penrose transform, this minus 
sign comes from integration by parts.) This means that (a2) automatically has a 


certain helicity, half the number of dotted minus undotted indices: 
h=Hn—m) — [w=3m-+n) 


(also for F > 0), as given by the above twistor operator expression acting on y+. 


(Alternatively, comparing the z-space form of the Pauli-Lubansky vector, its action 
plus that of the twistor-space one must vanish on |*)p,, so the helicity is again minus 


the twistor-space helicity operator acting on the prefactor.) 


If we work in momentum space, then we use implicitly the relation between mo- 


mentum and twistors. Then we can use the abbreviated form of the above relation, 


Dos. ctmBr.fn P) = Par ***PamBg, ***P3,X(Pa, Pa) 


using x, or y_ as appropriate to the sign of energy. 

The above transform is just for field strengths: The generalization to on-shell 
gauge fields is straightforward, though not as simple, since gauge fields contain more 
than just 2 physical helicities, but also unphysical degrees of freedom. For example, 


for the 4-vector potential of electromagnetism, we have 
A.s(2) = f pad’ps {Pslexplic®* paps) As (Po, Pa) 
+ exp(—ix**paps)A_4(Pa, Pa)| + h.c.} 


Exercise IIB7.3 
Look at the Maxwell field strength in spinor notation fag (and its complex 
conjugate) defined in subsection ITA7, in terms of the above gauge field. Show 


it reduces to a special case of the previous general expression, and express + 


in terms of Ay, and Axs. 


Since, after restricting to the appropriate helicity, the integral over this phase is 
trivial, we can also eliminate it by replacing the “volume” integral over the twistor 


or its complex conjugate (but not both) with a “surface” (boundary) integral: 


/ d’p. ¢ pdpo 


(Alternatively, we can insert a 6-function in the helicity.) The result is equivalent to 


the usual integral over the three independent components of the momentum. 
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This generalization of the Penrose transform implies that w(x) satisfies some 


equations of motion besides p? = 0, namely 


Pew, Bs.b = PW. 6.3 = 


which are also implied by S,°0, + wd, = 0 (see exercise IIB2.3). Besides Poincaré 


invariance, these equations are invariant under the phase transformation 


/ __ 22h0 ‘ 
Yap a Oe ee 


that generalizes duality and chiral transformations. We also see that (anti-)self- 
duality and chirality are related to helicity. Another way to understand the twistor 
result is to remember its interpretation as a Lorentz transformation from the light 
cone: In the light cone frame, where ptt is the only nonvanishing component of p**, 
the above equations of motion imply the only nonvanishing component of ypote-mB Bn 
is ptt which can be identified with y, (for ptt > 0) or y_ (for pt? <0). 
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Supersymmetry is a symmetry that relates fermions to bosons. It includes the 


Poincaré group as a subgroup. We’ll see later that quantum field theory requires par- 


ticles with integer spin to be bosons, and those with half-integer spin to be fermions. 


This means that any symmetry that relates bosonic wave functions/fields to fermionic 


ones must be generated by operators with half-integer spin. The simplest (but also 


the 


sect 


most general, at least of those that preserve the vacuum) is spin 1/2. In this 


ion we look at representations, generalizing the results of the previous sections 


for Poincaré symmetry. 


Although supersymmetry has not been experimentally verified yet, it is a major 


ingredient in the most promising generalizations of the Standard Model: 


(1) 


(2) 


The fact that it enlarges the symmetry of nature means that it further restricts 


the allowed models, and thus makes stronger predictions. 


The greater symmetry also simplifies quantum calculations in many ways, es- 
pecially through the use of the concept of “superspace”. The results of these 


calculations are also often simplified. 


Because supersymmetric calculations are simpler, they can be used to simplify 


nonsupersymmetric calculations, at both the classical and quantum levels. 


This simplification in quantum rules results in improved high-energy behavior. In 
some cases it even results in the absence of the infinities in momentum integration 
that occur in all nonsupersymmetric theories. Although these infinities can be 
removed in perturbation theory, their effects reappear upon summation of the 
expansion. An analogy can be drawn with the elusive Higgs boson: It also has 


not been observed, but is needed to remove certain infinities. 


This improvement at high energies also improves the experimental agreement of 


Grand Unified Theories of the strong, electromagnetic, and weak interactions. 


. Algebra 


From quantum mechanics we know that for any operator A 


(ILA, ATH d) = DU((WIAln) (n| At) + (| Atm) (n| Aly) 


n 


= S((njAlwy P+ [(nfAlv)?) > 0 
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from inserting a complete set of states. In particular, 


{A,AN}=0 3 A=0 


from examining the matrix element for all states |q). This means the anticommuta- 
tion relations of the supersymmetry generators must be nontrivial. 
We are then led to anticommutation relations of the form, in Dirac (Majorana) 
notation, 
{ag¢=b or {a,g'} = p"7av'2%0 
(We use translations instead of internal symmetry or Lorentz generators because of 
dimensional analysis: Bosonic fields differ in dimension from fermionic ones by half 


integers.) Note that this implies the positivity of the energy: 
tr{q,q'} = V2p* tr(ya0) = V2p* tra, Yoh) = Petr I 


Similar arguments imply that the supersymmetry generators are constrained, just 
as the momentum is constrained by the mass-shell condition. For example, in the 


massless case, 


{pq, DP} = ppp = —3p°p=0 => pq=0 


In four dimensions the commutation relations can be written in terms of irre- 


ducible spinors as 
{o, P}=n", {ag} ={7q =0 


This generalizes straightforwardly to more than one spinor, carrying a U(N) index: 
{ai*, P} = dip? 


Exercise IIC1.1 
Show positivity of energy in 2-component spinor notation for 4D U(N) super- 


symmetry. 


2. Supercoordinates 


Since the momentum is usually represented as coordinate derivatives, we natu- 
rally look for a similar representation for supersymmetry. We therefore introduce 
an anticommuting spinor coordinate 6°. Because of the anticommutation relations q 


can’t be simply 0/00, but the modification is obvious: 
0. 24 rg oO a Cm ec 
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We can also express supersymmetry in terms of its action on the “supercoordinates” : 


Using the hermitian infinitesimal generator €°qq + qa, 
SOPH, OH, FF = L(G? + CO) 
Note that (q*)' = q*, (qa)! = —Gg. 


We can also define “covariant derivatives”: derivatives that (anti)commute with 


(are invariant under) supersymmetry. These are easily found to be 


O er = 
dy = —— + 16° p ., oe oa 
age" 2 Pas age ? 
Besides overall normalization factors of 7, leading to the opposite hermiticity condition 
(d°)t = —d*, these differ from the q’s by the relative sign of the two terms. These 


changes combine to preserve 
{d°,d?}=p™, — {d,d} = {dd} =0 


as a result of which p is also a covariant derivative as well as being a symmetry 


generator (as for the Poincaré group), but now (d.)' = +dg. 


In classical mechanics, the fact that 0/Ox commutes with translations is “dual” to 
the fact that the infinitesimal change dz, or the finite change x — 2’, is also invariant 
under translations. Furthermore, the d’Alembertian D1 = (0/0x)* being Poincaré 
invariant is dual to the line element ds? = —(dx)? being invariant. This allows the 
construction of the action from #7. In the supersymmetric case the infinitesimal 


invariants under the q’s (and therefore p) are 
do’, — d0*, — dx” + 24(d0°) 0" + 4i(db?)0" 
and the corresponding finite ones (by integration) are 
peg, GA Fg _ glob + Ligagih + 1;G8 gra 


Although these can be used to construct classical mechanics actions, their quantiza- 
tion is rather complicated. Just as for particles of one particular spin, direct treatment 
of the quantum mechanics has proven much simpler than deriving it by quantization 


of a classical system. 


Exercise IITC2.1 
Check explicitly the invariance of the above infinitesimal and finite differences 
under supersymmetry. 

Now that we have a (super)coordinate representation of the supersymmetry gen- 


erators, we can examine the wave functions/fields that carry this representation. Such 
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“superfields” can be Taylor expanded in the @’s with a finite number of terms, with 
ordinary fields as the coefficients. For example, if we expand a real (hermitian) scalar 


superfield 
B(x, 0,0) = O(2) + 0% Wo (2) + Oba (az) +... 


and also expand its supersymmetry transformation 
5B = Yq + Chg + HieOO, 5h + Fic OP Opgad + ... 
we find the component field transformations 


5p = Yat Edy, Wa = hie Opb+..., Sg = —Hie*Opah t ..., 


(e7 


which mix the different spins. 


An alternative, and more convenient, way to define the 6 expansion is by use of 


the covariant derivatives. Using “|” to mean “go”, we can define 


o= d|, Ya = (d.®)|, Wa _ (dg®)|, 


There is some ambiguity at higher orders in 6 because the d’s don’t anticommute, and 
this can be resolved according to whatever is convenient for the particular problem, 
avoiding field redefinitions in terms of fields appearing at lower order in @: Since 
the field equations must be covariant under supersymmetry (otherwise there is no 
advantage to using superfields), they must be written with the covariant derivatives. 
Then one defines the component expansions by choosing the same ordering of d’s as 
appear in the field equations (where relevant), which gives the component expansion 
of the field equations the simplest form. It also gives a convenient method for deriving 


supersymmetry transformations, since the d’s anticommute with the q’s: 
6|(d...d®)|| = [d...d(d@)]| = [d...d(ieq@)]| = |(ieq)d...d®]| = [(ed)d...dG]| 


where we have used the fact that g = —id + 0-stuff, where the 6-stuff is killed by 
evaluating at @ = 0, once it has been pulled in front of all the @-derivatives. Covariant 
derivatives can also be used for integration, since [d? = 0/00 = d up to an a- 


derivative, which can be dropped when also integrating [ dz. 
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3. Supergroups 


We saw certain relations between the lower-dimensional classical groups that 
turned out to be useful for just the cases of physical interest of rotational (SO(D—1)), 
Lorentz (SO(D—1,1)), and conformal (SO(D,2)) groups. In particular, the Poincaré 
group, though not a classical group, is a certain limit (“contraction”) of the groups 
SO(D,1) and SO(D—1,2), and a subgroup of the conformal group. Similar remarks 
apply to supersymmetry, but because of its relation to spinors, these classical “su- 
pergroups” (or “graded” classical groups) exist only for certain lower dimensions, the 
same as those where covering groups for the orthogonal groups exist. In higher di- 
mensions the supergroups do not correspond to supersymmetry, at least not in any 


way that can be represented on physical states. 


We'll consider only the graded generalization of the classical groups that appear 
in the bosonic case. The basic idea is then to take the group metrics and combine 


them in ways that take into account the difference in symmetry between bosons and 


fermions: 
Unitary: ys 
OrthoSymplectic: M48 
Real: n ra 
pseudoreal (*): Q,? 


where 7 is symmetric and {2 antisymmetric, as before, while M/ is graded symmetric: 


For A = (a, a) with bosonic indices a and fermionic ones a, 
M4) —0: M®— mM = M%* — Meo — me? + Mee — 9 
Again we have inverse metrics, e.g., 
M"' Mx, = 64 


With respect to the usual index-contraction convention (no extra grading signs when 
superscript is contracted with subscript immediately following), we should take the 


ordering of indices on 6 as 6,1. 


There is no analog of the € tensor, at least for finite-dimensional groups, since it 
would have an infinite number of indices when totally symmetric. However, “special” 
supergroups can still be defined by generalizing the definition of trace and determinant 


to supermatrices. One convenient way to do this is by using Gaussian integrals, since 
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this is a common way that such expressions will arise. As a generalization of the 


bosonic and fermionic identities we therefore define the “superdeterminant” 
(sdet M)"'=N > dt dz e7'™ 


where “N” is a normalization factor defined so sdet I = 1. By explicitly evaluating 
the integral, separating out the commuting and anticommuting parts, we find (see 
exercise [B3.4) 


sas (< ) _ det A _ det(A — BD~!C) 
C D det(D — CA-'B) det D 
where here A and D contain only bosonic elements, while B and C' are completely 
fermionic. 
Exercise IIC3.1 
Generalize exercise IB3.4 to superdeterminants: Divide up the range of a 
square matrix into two (not necessarily equal) parts, where each of the two 
parts may include indices of both fermionic and bosonic grading, so the four 
resulting blocks in the matrix may each include both commuting and anti- 


commuting elements. Show that 
A B 
sdet & 5) = sdet D- sdet(A — BD~'C) = sdet A- sdet(D — CA™'B) 


by integration. 

The “supertrace” (see also exercise [A2.3c) then can be defined by generalizing 

the bosonic identity det(e“) = e"™: 
sdet(e™) —_ estr M 
str(M,4®) = (-1)}4M,4 = M,* — M,* =tr A—tr D 

follows, as in the bosonic case, from 6 In sdet M = str(M~16M), which is derived by 
varying the Gaussian definition. 

Exercise IIC3.2 


Show that for graded matrices we need to use str (and not tr) for the identity 
str(MN) = str(NM) 
A useful identity for superdeterminants can be derived by starting with the fol- 


lowing identity for the inverse of a matrix for which the range of the indices has been 


divided into two pieces: 


(= (Oa aa) 


162 II. SPIN 


We have assumed all the submatrices are square and invertible; equivalent expressions, 
which are more useful in other cases, can be derived easily by multiplying and dividing 


by the submatrices: For example, 


i ed eres 


From either of these we immediately see 


A By A B A B 2 
( ) -(¢ ; | — sact ( ) = det A det P= 
C D C D C D det D det A 


The graded generalizations of the classical groups are then 
GL(m|n,C) [SL(m|n,C),SSL(n|n,C)} 


U: [S]U(m,,m_|n) [SSU(n;,n_|n4+n_)] 
OSp: OSp(m|2n,C) 

R: GL(m|n) [SL(m|n),SSL(n|n)] 

*: [SJU*(2m|2n) [SSU*(2n|2n)] 


U & OSp 

R: OSp(m,,m_|2n) 

*: OSp*(2m|2n) 
where “(m|n)” refers to m bosonic and n fermionic indices, or vice versa. In the 
matrices of the defining representation, the elements with one bosonic index and one 
fermionic are anticommuting numbers, while those with both indices of the same kind 


are commuting. In particular, the commuting parts give the bosonic subgroups: 


GL(m|n,C) D GL(m,C)®GL(n,C) 
SL(mjn,C) D GL(m,C)@SL(n,C) 
SSL(n|n,C) D SL(n,C)@SL(n,C) 
U(m,,m_|n) D U(m,,m_)@U(n) 
SU(m;y,m_|n) D U(m4,m_)@SU(n) 
SSU(n,,n_|nyt+n_) D SU(n,,n_)@SU(n,+n_) 
OSp(m|2n,C) D SO(m,C)®@Sp(2n,C) 
GL(mjn) D GL(m)@GL(n) 
SL(m|n) D GL(m)@SL(n) 
SSL(n|n) D SL(n)@SL(n) 
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U*(2m|2n) D U*(2m)@U*(2n) 
SU*(2m|2n) D U*(2m)@SU*(2n) 
SSU*(2n|2n) D SU*(2n)@SU*(2n) 
OSp(m,,m_|2n) D SO(m,,m_)@Sp(2n) 
OSp*(2m|2n) D SO*(2m)@USp(2n) 


When the commuting and anticommuting dimensions are equal, we can impose trace- 
lessness conditions on both bosonic parts of the generators separately (“SS”, also 
called “PS”: tr A=tr D=0). This is related to the fact str(/) = 0 in such cases. 


4. Superconformal 


Since the conformal group is a classical group, its supersymmetric generalization 
should be a classical supergroup. Because the fermionic generators must include the 
supersymmetry generators, which are spinors, the representation of the conformal 
group that appears in the defining representation of the supergroup must be the spinor 
representation. However, we have seen that only for n<6 (where covering groups exist) 
and n=8 (where the spinor of SO(8) is another of its defining representations) can 
the spinor representation of SO(n) be defined by classical group restrictions. This 


implies that the superconformal group exists only in D<4 and D=6. 


The relevant supergroups can be identified easily by looking at the bosonic sub- 


groups: 
D=3: OSp(NIA) 
4: §U(2,2|N) (or SSU(2,2/4)) 
6: OSp*(8|2N) 


(We consider only D>2, since the conformal group is infinite-dimensional in D<2.) 
These three cases of D=3,4,6 are special for a number of reasons: In particular, 
these three supergroups can be related to SU(N|4) over the division algebras: the 
real numbers, complex numbers, and quaternions, respectively. (Similar remarks 
apply to their important classical bosonic subgroups: the conformal, Lorentz, and 
rotation groups. Attempts have been made to extend these results to the octonions 
for D=10, but with less success, and there seems to be no superconformal group for 
that case.) However, just as in the case of the Hilbert space of quantum mechanics, 
the complex numbers seems to be the best of these “division algebras”, having the 


analytic properties the real numbers lack, while avoiding the noncommutativity of 
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the quaternions. We’ll see later that nontrivial interacting (local, classical) conformal 
field theories exist only in D<4. 


For example, for D=4 we find that the bosonic generators are the conformal group 
and the internal symmetry group U(N) (or SU(4) for N=4), while the fermionic gener- 
ators include supersymmetry (N spinors) and its fraternal twin, “S-supersymmetry”. 
As supersymmetry is the “square root” of translations, so S-supersymmetry is the 


square-root of conformal boosts. 


Exercise IIC4.1 
For D=4, write the (graded) commutation relations of the superconformal 
generators. Decompose them into representations of the Lorentz group, and 


find their commutation relations. 


5. Supertwistors 


We saw that a simple way to find representations of SO(4,2) was to use the 
coordinate representation for SU(2,2): The resulting twistors gave all massless rep- 
resentations (p” = 0 for all helicities). This method generalizes straightforwardly to 


the superconformal groups: The generators are 
GeO 


(For the SU case we should also subtract out the trace, but that generator commutes 


with the rest anyway.) The coordinates and their conjugate momenta satisfy 
[Ca, C7} — aye 


C4 is then in the defining representation of the supergroup, while the wave function, 


which is a function of ¢, contains more general representations. 


For D=3, the reality condition sets ¢ = ¢, so the ¢’s are the graded generalization 
of Dirac y matrices. In fact, the anticommuting ¢’s are the y matrices of the SO(N) 
subgroup of the OSp(N]4). On the other hand, the commuting ¢’s carry the index of 
the defining representation of Sp(4), so they are a spinor of SO(3,2), the 3D conformal 
group: They are the bosonic twistor, and can be used in a similar way to the 4D 


twistors discussed earlier. 


For D=4, there is a U(1) symmetry acting on ¢ under which G4? is invariant, 


generated by (—1)4G 44, as in the bosonic case: This is the “superhelicity”. 


For D=6, ¢ is pseudoreal. In general, for pseudoreal representations of groups it is 


often convenient to introduce a new SU(2) under which the pseudoreal representation 
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¢4 and its equivalent complex conjugate representation ¢ BQ oa transform as a doublet 
(SU(2) spinor). This is also obvious from construction, since half of the components 


are related to the complex conjugate of the other half. We then can write 
co = (ce iC?.2,4) _ CBk EA. Qa" = Oec™ MaAtBE _ M48 Ctr 


Ga? = OFCai 
(Thus, OSp*(2m|2n)COSp(4n|4m), and SO*(2m)CSp(4m), USp(2n)CSO(4n).) This 


means there is now an SU(2) symmetry on ¢, generated by 
Gig = CP aC as) 


under which G4? is invariant. This is the 6D version of superhelicity. In the D=6 
light cone, the manifest part of Lorentz invariance is SO(D—2)=SO(4)=SU(2)@SU(2). 
This is one of those SU(2)’s. 


We now concentrate on D=4 (although our methods generalize straightforwardly 
to D=3 and 6). The simplest way to find (massless) representations of 4D super- 
symmetry is to generalize the Penrose transform. Just as twistors automatically 
satisfy the massless field equations in D=4, supertwistors automatically satisfy their 
supersymmetric generalization. The supertwistor is the defining representation of 
SU(2,2|N). The SU(2,2) part is the usual twistor, while the SU(N) part is the usual 
fermionic creation and annihilation operators for SU(N). Thus, to relate superspace 


to supertwistors, we write 


Pop = 10.3 > £PoPs 
dia = —10 i + 50° 0,5 — +4;Po, ds _ —id', + 50" Aga, — +alp. 


This determines the Penrose transform from superspace to supertwistors: 


p= (0? — 60? paps + OaiPe 
where we have used “chiral superfields” (trivial dependence on 6, via the constraint 
di. = 0) without loss of generality. (Instead of treating a; as coordinates to be 
integrated, we can also treat them as operators; we then make the y’s functions of a’, 
and replace the integration with vacuum evaluation (0| |0).) As for ordinary twistors, 
this result can be related to the lightcone: For given momentum, we can choose the 
lightcone frame p° = 03; then gq; = ta;, while g? = 0 is a result of the supertwistor 


formalism automatically incorporating pq = 0. 
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Exercise IIC5.1 
Find the Penrose transform for D=3. (Warning: The anticommuting part 
of the twistor is now like Dirac matrices rather than creation/annihilation 


operators. ) 


Taylor expanding in a; (and thus 6’°, producing terms antisymmetric in i...7 and 
symmetric in a...3), the states then carry the index structure ¢, ¢;, ;;, .--; é', d, totally 


antisymmetric, and terminating with another singlet, where 


ees 7 oo 
p= He’ ba.--in P= a bin-ins 


From our discussion of helicity in subsection IIB7, we see that the states also decrease 
in helicity by 1/2 for each a (i.e., ignoring 0, each 6** comes with a p,, simply because 
it adds an undotted index). Taking the direct product with any helicity (coming from 
the explicit p,’s and p,z’s carrying the external Lorentz indices), we see that the states 
have helicity h,h —1/2,h —1,...,h — N/2, with multiplicty ae for helicity h — n/2: 


| state | helicity (Poincaré) | multiplicity [SU(N)] 
$ h i 


This multiplet structure is carried separately by y, and by y_, which are related 
by charge (complex) conjugation, one describing the antiparticles of the other, as for 
ordinary twistors. (The existence of both multiplets also follows from CPT invariance, 
which is required for local actions, to be discussed in subsection IVB1. Here we 
generalized from the Penrose transform, which contained both terms as a consequence 
of being the most general solution to S“p, + wp* = 0, which is CPT invariant.) 
Because of the values of the helicities, we can impose a reality condition, identifying 
all states with helicity 7 as the complex conjugates of those with —j, only for —h = 
h—N/2—-h= N/4, when N is a multiple of 4. We can also get larger representations 


by taking the direct product of these smallest representations of supersymmetry with 
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representations of U(N), in which case the fields will carry those additional SU(N) 


indices. 


Exercise IITC5.2 


Let’s examine these 4D multiplets in detail: 


a List the SU(N) representations for each allowed value of N for each of the 


cases where the helicity |h| < 1/2 (“scalar multiplets”), 1 (“vector multi- 


plets”), 3/2 (“gravitino multiplets”), or 2 (“graviton multiplets”), assuming 


the maximum-helicity state is a singlet. 


b Show that “supergravity” (graviton multiplets) 


can exist only for N<8. Show 


that the relevant representation for N=8, if real, is the same as the one (com- 


plex) for N=7. 


c Find the analogous statements for “super Yang-Mills” (vector multiplets). 


The explicit form of the reality condition is somewhat complicated in terms of 


the chiral superfields, because they are really field strengths of real gauge fields. 


(Consider, for example, expressing reality of A.p in terms of fag in the case of elec- 


tromagnetism.) However, in terms of the twistor variables, charge conjugation can 


be expressed as 


C : X47 x=", ai 


where the transformation on a; is required because it 


(a;)! 
carries the SU(N) “charge”. 


Since this violates “chirality” in these variables (dependence on a and not a’), it is 


accomplished by Fourier transformation: 


C: X+4(a;) > c | aa ealai he 


for some “charge conjugation matrix” C’ (in case the fiel 
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In the previous chapters we considered symmetries acting on coordinates or wave 
functions. For the most part, the transformations we considered had constant param- 
eters: They were “global” transformations. In this chapter we will consider mostly 
field theory. Since fields are functions of spacetime, it will be natural to consider 
transformations whose parameters are also functions of spacetime, especially those 
that are localized in some small region. Such “local” or “gauge” transformations are 


fundamental in defining the theories that describe the fundamental interactions. 


Pa Neehaiebieseaters Sap. Oe id |), '). ee anenen mene irene es 


A fundamental concept in physics, of as great importance as symmetry, is the 
action principle. In quantum physics the dynamics is necessarily formulated in terms 
of an action (in the path-integral approach), or an equivalent Hamiltonian (in the 
Heisenberg and Schrédinger approaches). Action principles are also convenient and 
powerful for classical physics, allowing all field equations to be derived from a single 


function, and making symmetries simpler to check. 


1. General 


We begin with some general properties of actions. (For this subsection we’ll re- 
strict ourselves to bosonic variables; however, in the following subsection we’ll find 
that the only modification for fermions is a more careful treatment of signs.) Gen- 
erally, equations of motion are derived from actions by setting their variation with 


respect to their arguments to vanish: 


6S|9] = S[¢ + 66] — S[d] = 9 


The solutions to this equation (find ¢, given S') are “extrema” of the action; generally 
we want them to be minima, corresponding to minima of the energy, so that they will 
be stable under small perturbations. 
Exercise IITA1.1 
Often continuous coordinates are replaced with discrete ones, for calculational 
or conceptual purposes. Consider 


(oe) 


S=- S> 3 (dn+1— an)? 


n=—coO 


The integer n is interpreted as a discrete time, in terms of some “small” unit. 
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a Show that 
6S=O => qdnii— 24n+ Qn_-1 = 0 


b Examine the continuum limit of the action and equations of motion: Introduce 


appropriate factors of €, with t = ne, and take the limit € — 0. 


Now we take the variables ¢ to be functions of time; thus, S is a function of 
functions, a “functional”. It just means that S is a function of an infinite set of 
variables. We can generalize properties of ordinary functions (derivatives, etc.) as 


usual by considering discrete time and taking a continuum limit: 
i#=1,2,.. — t€[-00, co] 


% — ¢(t) 


3 ju 


bis _ o(t =f 5 
we ey ae 
06; do(t) 


few > [row 


(the last, a “functional integral”, will appear in quantum theory) where 6,; is the 
usual Kronecker delta function, while 6(t — t’) is the “Dirac delta function”. It’s not 
really a function, since it takes only the values 0 or oo, but a “distribution”, meaning 


it’s defined only by integration: 


[et senate) = 10) 
Of course, the variable ¢(t) can also carry an index (or indices). In field theory, it 
will also be a function of more coordinates, those of space. 


For example, making these substitutions into the definition of a (partial) deriva- 


tive to get a “functional derivative” , 


fd) _y FO +6y)- FG) _, S16) _,, fb) +6(t-2)] - Fl) 
00; «0 € do(t’) <0 c 
Sometimes the functional derivative is defined in terms of that of the variable itself: 
dp(t) j 
Tall) = 6(t— 1’) 


If we apply this definition of the Dirac 6 to 6¢/6¢, we obtain the previous definition of 
the functional derivative. (Consider, e.g., varying S = f dt f(t)d(t) for a fixed func- 


tion f.) However, in practice we never need to use these definitions of the functional 
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derivative: The only thing for which we need a functional derivative is the action, 


whose functional derivative is defined by its variation, 


I5|6] = Slo + 50] — slo] = [at 69S 


(The fact that the variation can always be written in this form is just the statement 


that it is linear in 6¢, since d¢ is “infinitesimal” .) 


A general principle of mechanics is “locality”, that events at one time directly 
affect only those events an infinitesimal time away. (In field theory these events can 
be also only an infinitesimal distance away in space.) This means that the action can 


be expressed in terms of a Lagrangian: 


sie] = | at Lot) 


where L at time t is a function of only ¢(t) and a finite number of its derivatives. For 
more subtle reasons, this number of time derivatives is restricted to be no more than 
two for any term in L; after integration by parts, each derivative acts on a different 


factor of ¢. The general form of the action is then 


L() a —16'" Gmn() os o” Am(¢) P U(¢) 


8 9 


where means O/Ot, and the “metric” g, “vector potential” A, and “scalar po- 
tential” U are not to be varied independently when deriving the equations of motion. 
(Specifically, 6bU = (6¢™")(0U/06™), etc. Note that our definition of the Lagrangian 
differs in sign from the usual: Thus, for a particle with kinetic energy T in a potential 
U with energy H =7T+U we have L = —T'+U.) The equations of motion following 
from varying an action that can be written in terms of a Lagrangian are 

55 8. 
og” og” 
where we have eliminated 56” terms by integration by parts (assuming 6¢ = 0 at 
the boundaries in t), and used the fact that 6¢(t) is arbitrary at each value of t. For 


example, 


0-55 = | arson 


s=- fatte > 0=6S = fat goa = f at (6a) > —=%=0 


Exercise IITA1.2 
Find the equations of motion for ¢” from the above general action in terms 
of the external fields g, A, and U (and their partial derivatives with respect 
to ¢). 
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Locality applies only to the classical action; in quantum field theory we will also 
find “effective actions” that include nonlocal contributions from quantum effects. 
Similar effects can appear in classical theories; for example, electrodynamics in the 
Coulomb gauge includes a (spatially) nonlocal Coulomb interaction term. The in- 
terpretation is always that some quantity has been eliminated, which would return 
locality (e.g., the “longitudinal photon” in the Coulomb gauge). Such actions can 
still be varied by the same methods as above. However, one should always avoid the 
rule 0OL/0¢ = O,(0L/0;¢), since (1) it applies only to actions that can be expressed 
in terms of just @ and ¢ (and not higher derivatives nor nonlocalities), and (2) it 


arbitrarily separates terms into two sets. 


Such actions can be reduced to ones that are only linear in time derivatives 
by introducing additional variables. First, separate out the subspace where g is 


invertible, with coordinates q (¢” = (q',r“)); the Lagrangian is then written as 
L(q,r) = 344 9ig(a,7) + FAG.) +7 A,(G7) +U(G,7) 

This Lagrangian gives equivalent equations of motion to 
L'(q,p,7) = [-@'pi + Ay] + [59° (Pi + Ai) (D5 + Aj) + UI 


where g“! is the inverse of g;;. (Many other forms are possible by redefinitions of p.) 
Eliminating the new variables p by their equations of motion gives back L(q,r). Note 
that this works only because p’s equations of motion are algebraic: For example, 
eliminating z from the Lagrangian —ép + 4p? by the equation of motion ¢ = p is 
illegal (it would give the trivial action S = — f dt sp”), since it would require solving 
for the time dependence of x. On the other hand, p is given explicitly in terms of 
the other variables by its equations of motion without inverting time derivatives, so 


eliminating it does not lose any of the dynamics. (It is an “auxiliary variable” .) 


The result is a Hamiltonian form of the Lagrangian: 
Li(®) = ib” Ay () + (6) 
in terms of the Hamiltonian H, where & = (q,p,r). It has the “gauge invariance” 
dAu = Oy A(G) 


(where 04 = 0/06™), since that adds only a total derivative term iA. Clearly A 
will introduce a modification of the Poisson bracket if it is not linear in @ (e.g., as 


when we make independent nonlinear redefinitions of coordinates and momenta on 
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the usual form of the Lagrangian). To determine this modification we compare the 


equation of motion as defined by a Poisson bracket, 
b™ = —i[6”, H] = -i[6", © OyH 
with that following from varying the action, 
—i®\ Fy + Oy H = 0, Fun = OmAn 
to find 
io”, 6) = (poe 
where “F~!” is the inverse on the maximal subspace where F is invertible. The 
variables in the directions where F vanishes are “auxiliary”, since they appear without 
time derivatives: Their equations of motion are not described by the Poisson bracket. 


In particular, if they appear linearly in H they are “Lagrange multipliers”, whose 


variation imposes algebraic constraints on the rest of @. 
Finally, we can make redefinitions of the part of ® describing the invertible sub- 
space so that A is linear: 


Amu = 32% Qnu => Ly(®) = sib" O* On + H(O) 


where 2 is a constant, hermitian, antisymmetric (and thus imaginary) matrix. For 
some purposes it is more convenient to assume this Hamiltonian form of the action 


as a starting point. We now have the canonical commutation relations as 


where 2” is the inverse of Quy on the maximal subspace: 


OY Dax = lip” 


for the projection operator JT for that subspace. 


Exercise IIITA1.3 

For electromagnetism, define o =E+iB. 

a Show that Maxwell’s equations (in empty space) can be written as two equa- 
tions in terms of w. 

b Interpret the equation involving the time derivative as a Schrodinger equation 
for the wave function w, and find the Hamiltonian operator. 

c Define the obvious inner product f d?x w* -: What physical conserved 
quantity does this represent? (Note that, unlike electrons, the number of 


photons is not conserved.) 
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Note that the requirement of the existence of a Hamiltonian formulation deter- 
mines that the kinetic term for a particle in the Lagrangian formulation go as <7 
and not 7%. Although such terms give the same equations of motion, they are not 
equivalent quantum mechanically, where boundary terms (dropped when using inte- 
gration by parts for deriving the equations of motion) contribute. Furthermore, the 


Hamiltonian form of the action 
Sa f dt de, 


shows that the energy H relates to the time in the same way the momentum relates to 
the coordinates, except for an interesting minus sign that is explained only by special 


relativity. 


2. Fermions 


In nonrelativistic quantum mechanics, spin is usually treated as a quantum effect, 
rather than being derived from classical mechanics. Although it is possible to derive 
spin from classical mechanics, in general it is rather cumbersome, and involves first 
introducing a large number of spins and then constraining away all the undesired ones, 
whereas in the quantum mechanics one can just directly introduce some particular 
representation of the spin angular momentum operators. The one nontrivial exception 
is spin 1/2. 

We know from quantum mechanics that the spin variables for spin 1/2 are de- 
scribed by the Pauli o matrices. Since they satisfy anticommutation relations, and 
are represented by finite-dimensional matrices, they are interpreted as fermionic. We 
have already seen that classical fermions are described by anticommuting numbers, 


so we begin by considering general quantization of such objects. 


We can now consider actions that depend on both commuting and anticommuting 
classical variables, 6” = (6, Ww"), where now ¢ refers to the bosonic variables and 7 
to the fermionic ones. The Hamiltonian form of the Lagrangian can again be written 
as 

Lu(®) = HOY“ ON On + H(G) 


When 2 is invertible, the graded bracket is defined by (see subsection IA2) 


[o™, pr = aie OY aig = oy 
To describe spin 1/2, we therefore look for particle actions of the form 
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This corresponds to using 
BM = (bb) = (OP) = (CoP) 


Lr = Dies p _ Oi ads Quy — 24 = 6 Quix = Qn = 0 


ag 


The fundamental commutation relations are then 
[x*, pj] = thd}, {b',w}=ho — ([, 2] = [p, p] = [2,4] = [p, d] = 0) 


We recognize 7" as the Pauli o matrices (the Dirac matrices of subsection IC1 for 
the special case of SO(3), normalized as in subsection ILA1), W' = Vho’. The free 


Hamiltonian is just 


p 


~ 2m 
as for spin 0: Spin does not affect the motion of free particles. 


A more interesting case is coupling to electromagnetism: Quantum mechanically, 


the Hamiltonian can be written in the simple form 


_ (ip. + ao)? 


mh 


H — gA°(z) 
in terms of the vector and scalar potentials A‘ and A®°. The classical expression is not 
as simple, because the commutation relations must be used to cancel the 1/h before 


taking the classical limit. This is an example of “minimal coupling”, 
H(p;) > H(p; + gAi) — A” 


However, this prescription works only if H for spin 1/2 is written in the above form: 
Using the commutation relations before or after minimal coupling gives different re- 
sults. The form we have used is justified only by considering the nonrelativistic limit 


of the relativistic theory. 


Exercise ITITA2.1 
Use the multipication rules of the o matrices to show that the quantum me- 
chanical Hamiltonian for spin 1/2 in an electromagnetic field can be written 
as a spin-independent piece, identical to the spin-0 Hamiltonian, plus a term 


coupling the spin to the magnetic field. 
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3. Fields 


Actions for field theories are just a special case (not a generalization) of the 
actions we have just considered: We just treat spatial coordinates 7 as part of the 


indices carried by the variables appearing in the action. In the notation used above, 


M = (i, 2) 
}™ (t) = G'(t, Z) 


Then spatial derivatives are just certain matrices with respect to the M index, [ dz 


comes from summation over M, etc. 


The field equations for all field theories (e.g., electromagnetism) are wave equa- 
tions. Wave equations also follow from mechanics upon quantization. Although 
classical field theory and quantum mechanics are not equivalent in their physical in- 
terpretation, they are mathematically equivalent in that they have identical wave 
equations. This is true not only for the free theories, but also for particles in external 
fields, and without direct self-interactions. This is no accident: Classical field theory 
and classical mechanics are two different limits of quantum field theory. They are 
both called classical limits, and written as h — 0, but since fh is really 1, this limit 


depends on how one inserts h’s into the quantum field theory action. 


The wave equation in quantum mechanics is the Schrodinger equation. The cor- 
responding field theory action is then simply the one that gives this wave equation 


as the equation of motion, where the wave function is replaced with the field: 


As usual (cf. electromagnetism), the field is a function of space and time; thus, we 
integrate d4x = dt d°x over the three space and one time dimensions. The Hamilto- 
nian is some function of coordinates and momenta, with the replacement p; — —70,, 


where 0; = 0/Ox' are the space derivatives and 0; = 0/dt is the time derivative. 


The Hamiltonian can contain coupling to other fields. For a general Hamilto- 
nian quadratic in momenta, in a notation implied by the corresponding Lagrangian 


quadratic in time derivatives, 


where g’’, A;, and U are now interpreted as fields, and thus depend on both 2‘ and t, 
as does w. In the case g’? = 6, we can identify A; and U as the three-vector and scalar 


potentials of electromagnetism, and we can add the usual action for electromagnetism 
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to the action for w. The action then can be varied also with respect to A and U to 
obtain Maxwell’s equations with a current in terms of wy and w*. We can also treat 
g) as a field, in which case it and parts of A and U are the components of the 


gravitational field. 


Field theory actions can be quantized in the same ways as mechanics ones. In 
this case, we recognize the 7*y) term as a special case of the 62 term in the 
generic Hamiltonian form of the action discussed earlier. Thus, ~(x’) and 7*(2;) 
have replaced x’ and p; as the variables; x’ is now just an index (label) on w and y*, 
just as 7 was an index on 2’ and p’. The field-theory Hamiltonian is then identified 


as 
Helv w'|= fon, n= Hy 
In field theory the Hamiltonian will always be a space integral of a “Hamiltonian 


density” H. 
The classical limit of a quantum theory defined by a classical action S can be 
defined as follows: Introduce h into the theory by replacing 


S3n's 


This has no effect on the classical equations of motion, but it introduces h into the 
Poisson bracket: i j 
—qp+H — -—=§p+=—H 


h h 
1 ; d O ol 
=e zal =%, Hn Det lt 


We can then recognize the limit h — 0 as the classical limit. In the quantum theory, 


it is equivalent to replacing 
p— —thd,, id, — H — iho, — H 


i.e., all derivatives get a factor of h. (More details will be possible when we consider 
quantization in subsection VA2.) However, a quantum theory can often be described 
by more than one classical action: This is known as “duality” (between any two such 


actions). 


In particular, any free quantum field theory, and many interacting ones, can be 
described by both a classical mechanics action and by a classical field theory action: 
This is the well-known “wave-particle duality”. We have just seen the standard 
nonrelativistic example. Furthermore, since we know the direct relation between the 


two actions in terms of the mechanics Hamiltonian H, we can describe both classical 
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limits directly in terms of just the field theory action. The classical field theory limit 
is defined by inserting h only as 


St = Ao" Sip 
On the other hand, if we put in h’s only as 
0; = ho;, O; => ho; 


which gives the usual h dependence associated with the Schrodinger equation, then 
the classical limit h — 0 gives classical mechanics. This defines classical mechanics 


as the macroscopic limit, the limit of large distances and times. 


A convenient way to implement this limit is to introduce the mechanics action 
S = f[ dt(—a'p; + H) into the field theory, and then take the limit h — 0 after the 


replacement 


SA's 


on the mechanics action instead of on the derivatives. The mechanics action can be 
introduced when solving the field equations: The solution to the wave equation can 
be expressed in terms of the propagator, which in turn can be written in terms of the 


mechanics action or Hamiltonian. 


Usually h is introduced through dimensional analysis; it has dimensions of action, 
whose dimension is defined by conservation laws vs. symmetries (see subsection IIB4) 
as energy X time or momentum x length. But the various parameters in the action 
may be assigned different dimensions in mechanics and field theory; for example, in 
classical mechanics one has p?+m?, while in classical field theory one has -O+m?, and 
these m’s differ by a factor of h in quantum field theory, but are unrelated classically, 
where one has dimensions of mass and the other of inverse length. (Consider, e.g., 
coupling a classical, massive particle to a classical, massive field.) So it’s only how 
you scale these constants (mass, charge, etc.) as h — 0 that gives different classical 
limits. Since field theory actions can have fields rescaled to put the inverse coupling 
in front of the action, the classical field theory limit is generally the limit of weak 
coupling. On the other hand, the classical mechanics limit, being macroscopic, is the 
limit of large mass; it’s also a strong coupling limit, as the action coming from a force 
between particles is proportional to the coupling, and gets a 1/h. (For example, the 
familiar fine structure constant e?/h of quantum mechanics is e?f in quantum field 


theory.) 
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More generally, we can define actions that are not restricted to be quadratic in 


any field. The Hamiltonian density H(t, x‘) or Lagrangian density L(t, x’), 


Sid] = ; dt dx Lid(t, x) 


should be a function of fields at that point, with only a finite number (usually no 
more than two) spacetime derivatives. This is the definition of locality used for gen- 
eral quantum systems in subsection IIIA1, but extended from derivatives in time to 
also those in space. Although this condition is not always used in nonrelativistic 
field theory (for example, when long-range interactions, such as Coulomb or gravita- 
tional, are described without attributing them to fields), it is crucial in relativistic 
field theory. For example, global symmetries lead by locality to local (current) conser- 
vation laws. Locality is also the reason that spacetime coordinates are so important: 
Translation invariance says that the position of the origin is an unphysical, redundant 


variable; however, locality is most easily used with this redundancy. 


Field equations are derived by the straightforward generalization of the variation 
of actions defined in subsection IITA1: As follows from treating the spatial coordinates 


in the same way as discrete indices, 


6S 
= tie A 
JS fe x Od (2) Soe, a) 
For example, 
s=— [dt de 30 => ca 
2 od 


Exercise ITIA3.1 


Consider the action 
s[é] = f dt a? [-46? + V(0) 


for potential V(¢) (a function, not a functional). 
a Find the field equations. 


b Assume V(¢) = Ad” for some positive integer n and constant, dimensionless 
A, in units h = c= 1. Use dimensional analysis to relate n and D (of course, 


also a positive integer), and list all paired possibilities of (n, D). 
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4. Relativity 


Generalization to relativistic theories is straightforward, except for the fact that 
the Klein-Gordon equation is second-order in time derivatives; however, we are fa- 
miliar with such actions from nonrelativistic quantum mechanics. As usual, we need 
to check the sign of the terms in the action: Checking the positivity of the Hamil- 
tonian (i.e., the energy), we see from the general relation between the Lagrangian 
and Hamiltonian (subsection IIIA1) that the terms without time derivatives must be 


positive; the time-derivative terms are then determined by Lorentz covariance. 


At this point we introduce some normalizations and conventions that will prove 
convenient for Fourier transformation and other reasons to be explained later. When- 


ever D-dimensional integrations are involved (as should be clear from context), we 


(0 =a) = (2)P76(w— 2), (pp!) = (2)?75?(p— pt) 


use 


In particular, this normalization will be used in Green functions and actions. For 


example, these implicit 27’s appear in functional variations: 


Fs So 6(e') = 5(0-2' 


= | dzro 
os / sry 50a) 


The action for a real scalar is then 
S= pu i L = 4(0¢)? + V(¢) 


where V(@) > 0, and we now write L for the Lagrange density. In particular, V = 
+m*¢? for the free theory. The free field equation is then p? + m? = —-O +m? = 0, 
replacing the nonrelativistic —i0,+.H = 0. For a complex scalar, we replace ¢¢ > ¥x 
in both terms. 

We know from previous considerations (subsection IIB2) that the field equation 
for a free, massless, Dirac spinor is y-OW = 0. The generalization to the massive case 
(subsection IIB4) is obvious from various considerations, e.g., dimensional analysis; 
the action is 


o= fu (id + aa)Y 
in arbitrary dimensions, again using the notation @ = 7-0. In four dimensions, we 


can decompose the Dirac spinor into its two Weyl spinors (see subsection IITA6): 


L = WiD + e)U = (UH ara + PRO apa) + Bib Ra + Vibra) 
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For the case of the Majorana spinor, the 4D action reduces to that for a single Weyl 


spinor, 
S= f dz (io, g6" + Ha + Hva) 
Note that in our conventions ony = Tous (and similarly for the opposite indices, since 


= 7%). so that the time derivative term is always proportional to yt(—idp)w, as 


nonrelativistically (previous subsection). 


A scalar field must be complex to be charged (i.e., a representation of U(1)): 


From the gauge transformation 


we find the minimal coupling (for q = 1) 
5, = f de BIO +iA)XP + $m}? 


The electromagnetic current is then defined by varying the matter action with respect 


to the gauge field: 


208, oe it 
t= = (—130 + A)x 


This action is also invariant under charge conjugation 
C:y > x*, A-——-A 
which changes the sign of the charge, since y*’ = e~y*. 


Exercise IITA4.1 
Let’s consider the semiclassical interpretation of a charged particle as de- 


scribed by a complex scalar field w, with Lagrangian 
L = 3(|Vol? + m?| 1’) 
a Use the semiclassical expansion in h defined by 
VohO+iqA, w— Jpes/* 


Find the Lagrangian in terms of p and S (and the background field A), order- 
by-order in h (in this case, just h° and h?). 


b Take the semiclassical limit by dropping the h” term in L, to find 
L = p3[(—0S + qA)? + m?] 
Vary with respect to S and p to find the equations of motion. Defining 


p=-o0S 
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show that these field equations can be interpreted as the mass-shell condition 
and current conservation. Show that A couples to this current by varying L 
with respect to A. 

The spinor field also needs doubling for charge. (Actually, the doubling can be 
avoided in the massless case; however, problems show up at the quantum level, related 
to the fact that there is no charge conjugation transformation without doubling.) The 
gauge transformations are similar to the scalar case, and the action again follows from 
minimal coupling, to an action that has the global invariance (A = constant in the 


absence of A): 
eaeug, Ue =e UE 
S.= f de (O(—i0,5 + A308 + GR(—i0,5 -A,3)0% + Ay(UiUna + Hida) 
The current is found from varying with respect to A: 
J? = pepe — pave 
Charge conjugation 
Cif OUR, A—-—A 
(which commutes with Poincaré transformations) changes the sign of the charge and 


current. 


Exercise ITTA4.2 


Show that this action can be rewritten in Dirac notation as 
i. = fu WId— A+ Tv 
and find the action of the gauge transformation and charge conjugation on 
the Dirac spinor. 
As a last example, we consider the action for electromagnetism itself. As before, 


we have the gauge invariance and field strength 


A 0 +X 


f —. — 

aB — Avs a 
= _ - = 4 ; 

Fra. = 9a4Ags — Og5Aay = Casas + Crp fap, fap = 30(a4Apy" 

We can write the action for pure electromagnetism as 


Sa= uw sf fap = fe anf Fg = pu pa FO Fas 


dropping boundary terms, with the overall sign again determined by positivity of the 


Hamiltonian, where e is the electromagnetic coupling constant, i.e., the charge of the 
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proton. (Other normalizations can be used by rescaling A,5-) Maxwell’s equations 


follow from varying the action with a source term added: 
S=S4t+ pu APJ, 5 => 40° 9 faa — dey 


Exercise IIITA4.3 
By plugging in the appropriate expressions in terms of A, (and repeatedly 
integrating by parts), show that all of the above expressions for the electro- 


magnetism action can be written as 
Sa=- f de [A-DA + (0- A)?) 


Exercise IITA4.4 
Find all the field equations for all the fields, found from adding to S44 all the 
minimally coupled matter actions above. 
Having seen many of the standard examples of relativistic field theory actions, we 
now introduce one of the most important principles in field theory; unfortunately, it 


can be justified only at the quantum level (see chapter VII): 


Good ultraviolet behavior: All quantum field theories should have only couplings with 


nonnegative mass (engineering) dimension. 


(Here “couplings” means the coefficients of arbitrary terms, when the fields have been 


defined so that the massless parts of the kinetic terms have no coupling dependence.) 


Exercise IITA4.5 
Show in D=4 using dimensional analysis that this restriction on bosons ¢ and 


fermions w restricts terms in the action to be of the form 


0, 0°, 8G, G06, $06, 006; W?, YOY; oY 


and find the dimensions of all the corresponding coupling constants. 


The energy-momentum tensor for electromagnetism is much simpler in this spinor 
notation, and follows (up to normalization) from gauge invariance, dimensional anal- 
ysis, Lorentz invariance, and the vanishing of its trace. It has a form similar to that 


of the current in electrodynamics: 
-_ 1 f 
Tas3 = — atop lss 


Note that it is invariant under the duality transformations of subsection IIA7 (as is 


the electrodynamic current under chirality). 
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We have used conventions where e appears multiplying only the action Sy, and 


not in the “covariant derivative” 
V=0+iqA 


where q is the charge in units of e: e.g., ¢g = 1 for the proton, g = —1 for the electron. 


Alternatively, we can scale A, as a field redefinition, to produce the opposite situation: 
AeA: Sa f de BF? V—0O+igeA 


The former form, which we use unless noted otherwise, has the advantage that the 
coupling appears only in the one term Sy, while the latter has the advantage that the 
kinetic (free) term for A is normalized the same way as for scalars. The former form 
has the further advantage that e appears in the gauge transformations of none of the 
fields, making it clear that the group theory does not depend on the value of e. (This 


will be more important when generalizing to nonabelian groups in section IIIC.) 


Note that the massless parts of the kinetic (free) terms in these actions are scale 
invariant (in arbitrary dimensions, when the dimension-independent forms are used), 
when the fields are assigned the scale weights found from conformal arguments in 
subsection ITB2. 


Exercise IITIA4.6 
Using vector notation, minimal coupling, and dimensional analysis, find the 
mass dimensions of the electric charge e in arbitrary spacetime dimensions, 


and show it is dimensionless only in D = 4. 


An interesting distinction between gravity and electromagnetism is that static 
bodies always attract gravitationally, whereas electrically they repel if they are like 
and attract if they are opposite. This is a direct consequence of the fact that the 
graviton has spin 2 while the photon has spin 1: The Lagrangian for a field of integer 
spin s coupled to a current, in an appropriate gauge and the weak-field approximation, 
is 

L = —Fb7"™ Obay...an + 9 Ga1...0, 7 


for some coupling g, where the sign of the first term is fixed by unitarity in quantum 


field theory, or by positivity of the energy in classical field theory: 


Lo = —4 ieee Bl ee = AHo= le Dic a (0b) (Oi) ar...a6] 


(Time components of ¢@ are unphysical, arising from gauge fixing, and so should be 


ignored as far as arguments of unitarity or positivity of energy are concerned.) From 
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a scalar field in the semiclassical approximation (see exercise IIIA4.1 above), starting 
with 
Jus —_ w*(-4i0™) ee (—310%)w 
where 
AO B = AOB — (OA)B 


we see that the current will be of the form 


Jods — ppt... y's 
for a scalar particle, with “density” p. (The same follows from comparing the ex- 
pressions for currents and energy-momentum tensors for particles as in subsection 
IIIB4 below. The only way to get vector indices out of a scalar particle, to couple to 
the vector indices for the spin of the force field, is from momentum.) In the static 
approximation, only time components contribute: We then can write this Lagrangian 


as, taking into account 9 = —1, 


L= —(—1)°4¢0...00¢0...0 a +9%0...0P(p°)* 
where E = p® > 0 for a particle and < 0 for an antiparticle. Solving for ¢ by its field 
equation and plugging back in, we have 


Ss Ss 1 Ss 
EL = (1) 9° pE* = pE 


Since we’re looking at the static case, can be replaced with the Laplacian A, and 
the Lagrangian (density) is the same as the Hamiltonian (density), so the “potential 
energy” V produced by this interaction (we have neglected the “kinetic energy”, or 
pure w terms in the action) is, in D=4, 


da da’ 


V = —(-'30n9)4 | Sa VEO 


where we have used i ‘ 
—_ 63 ee ee ee ae 
A aa An|x — 2’ | 


in terms of the 3D distance |x — x’). 


Thus the spin-dependence of the potential/force 
between two particles goes as —(— FE £2)*. It then follows that all particles attract by 
forces mediated by even-spin particles, and a particle and its antiparticle attract under 
all forces, while repulsion will occur for odd-spin forces between two identical particles. 
(We can substitute “particles of the same-sign charge” for “identical particles”, and 
“particles of opposite-sign charge” for “particle and its antiparticle”, where the charge 


is the coupling constant appropriate for that force.) 
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Exercise ITIA4.7 


Show that the above current is conserved, 
a0 ap 


(and the same for the other indices, by symmetry) if w satisfies the free Klein- 


Gordon equation (massless or massive). 


5. Constrained systems 


Constraints not only frequently appear in nonrelativistic physics, but are a general 
feature of relativistic particles, so we now give a brief description of how they are 
incorporated into actions. Consider a general action, with constraints, in Hamiltonian 


form: 


S= / dt(—q""pm +H), H = Hgila,p) + 'Gi(a,p) 


(For simplicity, we consider all physical variables to be bosonic for this subsection, 
but the method generalizes straightforwardly paying careful attention to signs.) This 
action is a functional of g™, Pm, A’, which are in turn functions of t, where m and 7 run 
over any number of values. We can think of this as describing a nonrelativistic particle 
with coordinates g and momenta p in terms of time t, but the form is general enough to 
apply to relativistic theories. The gp term tells us p is canonically conjugate to q; the 
rest of the action gives the Hamiltonian, usually quadratic in momenta. The variables 
\' are “Lagrange multipliers”, whose variation in the action implies the constraints 
G; = 0. We then can interpret H,; as the usual (“gauge invariant”) Hamiltonian. We 
also require that the transformations generated by the constraints close, and that the 


Hamiltonian be invariant: 
Gas G,| = —ifis*Gr, [Gi, By = 0 


(More generally, we can allow [G;, Hyi] = —if//G;.) This says that the constraints 
don’t imply any new constraints that we might have missed, and that the “energy” 
represented by H,; is invariant under these transformations. In general, not all con- 
straints commute with the Hamiltonian, and thus those constraints are not time 
independent; we are considering here just the ones that do. The ones that don’t, 
including their Lagrange multipliers, are implicitly included in the gauge invariant 
Hamiltonian. (Thus, the time-dependent constraints must commute with the time- 


independent ones.) 
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We then find that the action is invariant under the canonical transformations 


6(q,p) =i1C'Gi,(¢p)) = og" =¢ Daa OPm = Cogn 
ee aes, = i(6N)G; — iC'G; + [G;, CGY] 
Nat) oN ae Tt ee eile 


=> ON = C+ COM fag! 
(with 6(d/dt) defined as in subsection IA1), where 0/Ot acts on the “explicit” ¢ 


dependence (that in everything except q and p): For general expressions, the total 


time derivative and total variation are given by commutators as 


cA = cA +i[H, A], 6A=6)A+ i[C’G, A] 
where 69 acts on everything except gq and p. The action then varies under these 
transformations as the integral of a total derivative, which vanishes under appropriate 
boundary conditions: 
5Sy = / at “[-(6a")Pm + 0G] =0 

The simplest example is the case with one constraint, which is linear in the 
variables: If the constraint is p, the gauge transformation is dq = ¢, so we gauge 
q = 0 and use the constraint p = 0. In general, this means that for every degree of 
freedom we can gauge away, the conjugate variable can be fixed by the constraint. 
Thus, for each constraint we eliminate 3 variables: the variable fixed by the constraint, 
its conjugate, and the Lagrange multiplier that enforced the constraint, which has 
no conjugate. (In the Lagrangian form of the action the conjugate may not appear 
explicitly, so only 2 variables are eliminated.) As an example of a constraint that does 
not generate a gauge invariance, consider a nonrelativistic particle constrained to a 
sphere by G = (x*)?—1: We can change to spherical coordinates, apply the constraint 
to eliminate the radial coordinate, and then eliminate the radial component of the 
momentum as an auxiliary variable (not appearing with time derivatives), leaving 
an unconstrained theory in terms of angles and their conjugates. In most cases in 
field theory a similar procedure can be applied, eliminating both gauge and auxiliary 


variables: The result is called a “unitary gauge”. 
Exercise ITIA5.1 
Let’s look closer at this example: 


a Perform quantization of a nonrelativistic particle on a sphere (G = (z')? — 1 
for i = 1, 2,3), reducing to an action in terms of just the angles 0 and ¢, and 


their conjugates. 
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b Repeat this procedure using instead the gauge invariant Hamiltonian H,; = 
(x')?(p’)?/2m and the time-independent constraint G’ = x'p;, and compare. 


Show the relation of what’s left of the Hamiltonian to the angular momentum. 


The standard example of a relativistic constrained system is in field theory — 
electromagnetism. Its action can be written in “first-order (in derivatives) formalism” 


by introducing an auxiliary field Gyo: 
F?  F?— G? = F? -(G— F)? = 2GF -G 


where in the first step we added a trivial term for G and in the second step made 
a trivial redefinition of G, so elimination of G by its algebraic equation of motion 
returns the original Lagrangian. The Hamiltonian form comes from eliminating only 


G; by its field equation, since only Fo; contains time derivatives: 
2GF — G = (F,;)? = 4G iF i + 2(Go;)? 
= —4A;Go; 3 [2(Goi)? or (F;;)7| — 4A0;G'o; 


which we recognize as the three generic terms for the action in Hamiltonian form, 
with Go; as the canonical momenta for A;, and Ag as the Lagrange multiplier. The 


constraint is Gauss’ law, and it generates the usual gauge transformations. 


Thus 4’ are also gauge fields for the gauge (time-dependent) transformations ¢'(t). 
They allow construction of the gauge-covariant time derivative 
: d ' 
V= O; + ir'G;, di =V+ Dlg > 6oV = alC"Ga, Vv 
It is convenient to transform the gauge fields away using these gauge transformations, 
so H = H,;. However, with the usual boundary conditions i aoe dt \' is gauge invariant 
under the linearized transformations, so the most we could expect is to gauge * to 


constants. More precisely, the group element 


im lex (-i i; : dt (6) 


is gauge invariant, where “J” is time ordering, meaning we write the exponential of 
the integral as the product of exponentials of infinitesimal integrals, and order them 
with respect to time, later time intervals going to the left of earlier ones. (We treat 
G; quantum mechanically or use Poisson brackets when combining the exponentials. ) 
This is the quantum mechanical version of the time development resulting from the 
corresponding term in the classical action. It is also the phase factor coming from 


the infinite limit of the covariant time translation 


t 
ekV®) — 7 lex (-: | dt! eyes) e Bee 
t—k 
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-kV() ag the product of 


as seen from reordering the time derivatives when writing e 
exponentials of infinitesimal exponents. This allows us to write the explicit gauge 


transformation 


t 
e AW => lex (-: | dt! x(G)] -_ ge Uv eum. At =t— to 
t 


0 
> Vit) =e4OV (te 4 = ce AO%V (t)e(40% — V(to) = O + id"(to) Gi 


(where we define 0; to vary t while keeping t — to fixed). Thus, we can gauge X to 
its value at a fixed time ty. Another way to see this is that varying ’ in the action 
at a fixed time gives G; = 0 at that time, but the remaining field equations imply 
G,; = 0, so G; = 0 always, and 4’ is redundant at other times. This means that if we 


carelessly impose \’ = 0 at all times, we must also impose G; = 0 at some fixed time. 


Note that this special gauge transformation itself has a very simple gauge trans- 


formation: Transforming the A in A by an arbitrary finite transformation ¢'(¢), 


coi) = GiGi gt) gi (bo) Gi 

consistent with the transformation law of V’(t) above. Thus, applying the trans- 
formation A to any gauge-dependent quantity @ gives a gauge-independent quantity 
@'(¢, X), which is invariant under the local transformations ¢(t) and transforms only 
under the “global” transformations ¢(t)). Thus, fixing the gauge A(t) = 0 is equivalent 


to working with gauge-invariant quantities. 


Fixing an invariance of the action is not unique to gauge invariances: Global 
invariances also need to be fixed, although the procedure is so trivial we seldom 
discuss it. For example, even in nonrelativistic systems Galilean invariance needs to 
be fixed: When analyzing a specific problem, we often choose some object to be at 
rest (velocity transformations), choose another to be oriented or moving in a specific 
direction (rotations), and choose a specific event to happen at the origin of space and 
time (translations). Alternatively, we can work with Galilean invariants, just as in 
gauge theories we can work with gauge invariants; however, in practice, for explicit 
calculations (as opposed to discussing general properties), it is more convenient to fix 


the invariance, as this allows simplification of the equations. 
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The simplest relativistic actions are those for the mechanics (as opposed to field 
theory) of particles. These also give the simplest examples of gauge invariance in rela- 
tivistic theories. Later we will find that various properties of the quantum mechanics 


of these actions help to explain some features of quantum field theory. 


1. Free 


For nonrelativistic mechanics, the fact that the energy is expressed as a function of 
the three-momentum is conjugate to the fact that the spatial coordinates are expressed 
as functions of the time coordinate. In the relativistic generalization, all the spacetime 
coordinates are expressed as functions of a parameter 7: All the points that a particle 
occupies in spacetime form a curve, or “worldline”, and we can parametrize this curve 
in an arbitrary way. Such parameters generally can be useful to describe curves: A 
circle is better described by x(@), y(@) than y(x) (avoiding ambiguities in square roots), 
and a cycloid can be described explicitly only this way. 


The action for a free, spinless particle then can be written in relativistic Hamil- 


tonian form as 
Su = f drt" pm + v4(p? +m?) 


where v is a Lagrange multiplier enforcing the constraint p? + m? = 0. This ac- 
tion is very similar to nonrelativistic ones, but instead of x'(t), p;(t) we now have 


({P) 
. 


xz™(T), Pm(T), v(T) (where now means d/dr). The gauge invariance generated by 
p? +m? is 


6st = Cp, op= 0, dv=¢ 


Exercise ITIB1.1 


Consider the action 
S = f ar {EB —'p,)— vfle(a)E* — ('? — mi} 


describing propagation of a particle in a medium with a “dielectric constant” 
e(x). Using its equations of motion, 

a Show that the “group velocity” dE/dp’ is just the usual velocity dz*/dt. (This 
agrees with the usual interpretation of group velocity as the velocity of infor- 


mation.) 
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b Show that the components of the “wave velocity” p'/E are conserved, for 
time-independent ¢, in directions in which € doesn’t change. If € is a func- 
tion of only one spatial dimension (as in the usual light refraction problems), 
these conservation laws, together with the energy-momentum relation, allow 
all components of the wave velocity (and thus the group velocity) to be de- 


termined from initial values. 
c Show that for a massive particle neither of these is the same as the “phase 
velocity” vu’ = d2'/dt, defined by 
0 = d(phase) = (d2™")pm ~ v'p; — E 
even in empty space. (Since this is only 1 equation for 3 unknowns, it is really 
more of a “phase speed”.) Examine phase velocity in the rest frame. 
A more recognizable form of this invariance can be obtained by noting that any 


action $(@*) has invariances of the form 


oS 
A AB AB BA 
op ——as 508? = 


which have no physical significance, since they vanish by the equations of motion. In 


this case we can add 
dx=e(é—vup), dp=ep, dv=0 
and set ¢ = ve to get 
dx=et, dp=ep, dv= (ev) 


We then can recognize this as a (infinitesimal) coordinate transformation for T: 
g(r) S=ar), prior), Aree) Hera: T =T-—€(T) 


The transformation laws for x and p identify them as “scalars” with respect to these 
“one-dimensional” (worldline) coordinate transformations (but they are vectors with 
respect to D-dimensional spacetime). On the other hand, v transforms as a “density”: 
The “volume element” dt v of the world line transforms as a scalar. This gives us 
a way to measure length on the worldline in a way independent of the choice of 7 


parametrization. Because of this geometric interpretation, we are led to constrain 
v>0 


so that any segment of the worldline will have positive length. Because of this re- 


striction, v is not a Lagrange multiplier in the usual sense. This has significant 
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physical consequences: p? + m? is treated neither as a constraint nor as the Hamil- 
tonian. While in nonrelativistic theories the Schrédinger equation is (EF — H)w = 0 
and G;w = 0 is imposed on the initial states, in relativistic theories (p? + m)w = 0 
is the Schrodinger equation: This is more like Hy = 0, since p? already contains the 


necessary E’ dependence. 


The Lagrangian form of the free particle action follows from eliminating p by its 


equation of motion vp = &: 
Sn fe 3(vm? — v*é”) 
For m # 0, we can also eliminate v by its equation of motion v~74? + m? = 0: 
S=m farv=B=m | Vda =m | ds =ms 


The action then has the purely geometrical interpretation as the proper time; how- 
ever, this last form of the action is awkward to use because of the square root, and 
doesn’t apply to the massless case. Note that the v equation implies ds = m(dr v), 
relating the “intrinsic” length of the worldline (as measured with the worldline vol- 
ume element) to its “extrinsic” length (as measured by the spacetime metric). As a 
consequence, in the massive case we also have the usual relation between momentum 
and “velocity” 


ae mon 
— ds 


(Note that p® is the energy, not po.) 
Exercise IIIB1.2 
Take the nonrelativistic limit of the Poincaré algebra: 

a Insert the speed of light c in appropriate places for the structure constants of 
the Poincaré group (guided by dimensional analysis) and take the limit c — 0 
to find the algebra of the Galilean group. 

b Do the same for the representation of the Poincaré group generators in terms 
of coordinates and momenta. In particular, take the limit of the Lorentz 
boosts to find the Galilean boosts. 

c Take the nonrelativistic limit of the spinless particle action, in the form ms. 
(Note that, while the relativistic action is positive, the nonrelativistic one is 
negative. ) 

Exercise ITIB1.3 


Consider the following action for a particle with additional fermionic variables 
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y and additional fermionic constraint ¥ - p: 
Sx = / dr(—4 Pm — 319" Ym + gup" + try - P) 


where 4 is also anticommuting so that each term in the action is bosonic. 


a Find the algebra of the constraints, and the transformations they generate on 


the variables appearing in the action. 
b Show that the “Dirac equation” y - p|W) = 0 implies p?|W) = 0. 
c Find the Lagrangian form of the action as usual by eliminating p by its equa- 


tion of motion. (Note A? = 0.) 


Exercise IIIB1.4 
Consider a “supercoordinate” X™ that is a function of both a fermionic vari- 


able ¢ and the usual r: 
X™(7,C) = a™(r) + 107"(7) 


where the Taylor expansion in ¢ terminates because ¢? = 0. Identify x with 

the usual x, and y with its fermionic partner introduced in the previous 

problem. In analogy to the way y-p was the square root of the 7-translation 

generator SP”, we can define a square root of 0/07 by the “covariant fermionic 

derivative” 3 5 5 
i= ac t Sa, = DP = iz 

We also want to generalize v in the same way as x, to make the action inde- 


pendent of coordinate choice for both 7 and ¢. This suggests defining 
Ba=w'+i0A 
and the gauge invariant action 
s, = [eae 5E(D?X™) DXi 


Integrate this action over ¢, and show this agrees with the action of the 


previous problem after suitable redefinitions (including the normalization of 
J dc). 


The (D+2)-dimensional (conformal) representation of the massless particle (sub- 


section IA6) can be derived from the action 


sa f dr (P+) 
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where 4 is a Lagrange multiplier. This action is gauge invariant under 
dy =ey— dey, SA=eA+ 2A +4 4E 


If we vary \ to eliminate it and y~ as in subsection IA6, the action becomes 


a= - far se7 ah 
3 


which agrees with the previous result, identifying v = e~-, which also guarantees 
o> 0: 


Exercise IIIB1.5 
Find the Hamiltonian form of the action for y: The constraints are now y’, 
r?, and y-r, in terms of the conjugate r to y (see exercise IA6.2). Find 
the gauge transformations in the standard way (see subsection HIA5). Show 
how the above Lagrangian form can be obtained from it, including the gauge 


transformations. 


Using instead the corresponding twistor (subsection IIB6) to satisfy y? = 0, the 


massless, spinless particle now has a single term for its mechanics action: 
s= fe Fe apcp ene 2F2” g 


Unlike all other relativistic mechanics actions, all variables have been unified into just 


z, without the introduction of square roots. 


Exercise IJIB1.6 
Expressing z in terms of \,” and 2“ as in subsection IIB6, show this action 


reduces to the previous one. 


2. Gauges 


Rather than use the equation of motion to eliminate v it’s more convenient to use 
a gauge choice: The gauge v = 1 is called “affine parametrization” of the worldline. 
Note that the gauge transformation of v, 6v = ¢, has no dependence on the coordi- 
nates x and momenta p, so that choosing the gauge v = 1 avoids any extraneous x 
or p dependence that could arise from the gauge fixing. (The appearance of such de- 
pendence will be discussed in later chapters.) Since T = f dr v, the intrinsic length, 
is gauge invariant, that part of v still remains when the length is finite, but it can be 
incorporated into the limits of integration: The gauge v = 1 is maintained by ¢ = 0, 
and this constant ¢ can be used to gauge one limit of integration to zero, completely 


fixing the gauge (i.e., the choice of 7). We then integrate lie where T' > 0 (since 
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originally v > 0), and T is a variable to vary in the action. The gauge-fixed action is 
then 


T 
Snap = f dr[—2 Dm + U(p? + m2) 
0 


In the massive case, we can instead choose the gauge v = 1/m; then the equations 
of motion imply that 7 is the proper time. The Hamiltonian p?/2m + constant then 


resembles the nonrelativistic one. 
Another useful gauge is the “lightcone gauge” 


at 


T= pt 

which, unlike the Poincaré covariant gauge v = 1, fixes T completely; since the gauge 
variation 6(a+/pt) = ¢, we must set ¢ = 0 to maintain the gauge. Also, the gauge 
transformation is again x and p independent. In lightcone gauges we always assume 
p* #0, since we often divide by it. This is usually not too dangerous an assumption, 


since we can treat pt = 0 as a limiting case (in D>2). 


We saw from our study of constrained systems that, for every degree of freedom we 
can gauge away, the conjugate variable can be fixed by the constraint that generates 
that gauge invariance: In the case where the constraint is p, the gauge transformation 
is dq = A, so we gauge g = 0 and use the constraint p = 0. In lightcone gauges the 
constraints are almost linear: The gauge condition is x* = ptr and the constraint is 
p =..., 80 the Lagrange multiplier v is varied to determine p~. On the other hand, 
varying p gives 


7 S vel 


so this gauge is a special case of the gauge v = 1. An important point is that we used 
only “auxiliary” equations of motion: those not involving time derivatives. (A slight 
trick involves the factor of pt: This is a constant by the equations of motion, so we 
can ignore p* terms. However, technically we should not use that equation of motion; 
instead, we can redefine x~ — a2 +..., which will generate terms to cancel any p* 


terms.) The net result of gauge fixing and the auxiliary equation on the action is 


Susc= [arle-p* — a'p' + $00" + m2) 
where x* = (x*, x~, x’), etc. In particular, since we have fixed one more gauge degree 
of freedom (corresponding to constant ¢), we have also eliminated one more constraint 
variable (J, the constant part of v). This is one of the main advantages of lightcone 


gauges: They are “unitary”, eliminating all unphysical degrees of freedom. 
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Exercise IIIB2.1 
Another obvious gauge is T = x°, which works as well as the lightcone gauge 
as far as eliminating worldline coordinate invariance is concerned. (The same 


is true for 7 = n-a for any constant vector n.) 


a Consider the auxiliary equations of motion: Apply this gauge condition; then 
p° appears without time derivatives, so eliminate it and v by their equations 


of motion. Show this gauge is consistent only for p® > 0. 


b The resulting square root is awkward except in the nonrelativistic limit: Take 


it, and compare with the usual nonrelativistic mechanics. 


c A better type of gauge is T = n- a/n-p, what we actually used for the 
lightcone. Compare the value of v that results from the field equations in this 
case to that of the case T = n-x. Discuss the consistency of this case in terms 


of the allowed signs of n- x and n-p vs. those of 7 and v. 


We have seen that the lightcone gauge is a special case of the covariant (affine) 
gauge, where more components are eliminated (a unitary gauge). In other textbooks, 
gauge fixing to a unitary gauge is always performed in two steps, by first going to a 
covariant gauge, and then using the “residual” gauge invariance to completely fix the 
gauge. (This has been done for particles, strings, gauge theories, and even general 
relativity.) When this procedure is explicitly performed, the result can be seen to be 


a lightcone gauge. Clearly it is easier to perform all the gauge fixing in one step. 


3. Coupling 


One way to introduce external fields into the mechanics action is by considering 


the most general Lagrangian quadratic in 7 derivatives: 
fo [atl $e Gm (082 + Am(x)£™ + v¢(x)] 


In the free case we have constant fields Gmn = %mn, Am = 0, and @ = sm? The v 
dependence has been assigned consistent with worldline coordinate invariance. The 
curved-space metric tensor gmp describes gravity, the D-vector potential A,, describes 
electromagnetism, and @ is a scalar field that can be used to introduce mass by 


interaction. 


Exercise ITIB3.1 
Use the method of the problem IIIB1.4 to write the nonrelativistic action 
for a spinning particle in terms of a 3-vector (or (D—1)-vector) X‘(7,¢) and 


the fermionic derivative D. Find the coupling to a magnetic field, in terms 
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of the 3-vector potential A;(X). Integrate the Lagrangian over ¢. Show 
that the quantum mechanical square of w’[p; + A;(x)] is proportional to the 


Hamiltonian. 


Exercise ITIB3.2 


Derive the relativistic Lorentz force law 
8-(u En) + Fant” = 0 


by varying the Lagrangian form of the action for the relativistic particle, in 
an external electromagnetic field (but flat metric and ¢ = $m?), with respect 


to x. 


This action also has very simple transformation properties under D-dimensional 


gauge transformations on the external fields: 
09mn = © OpGmn+ GrimOnye, Am = O,Am+ApOme —OmdA, J¢= Oo 


where we have integrated the action ["! dr and set x(7;) = x, 2(Tf) = zy. These 


transformations have a very natural interpretation in the quantum theory, where 
[re e = (x5|x;) 
Then the A transformation of A is canceled by the U(1) (phase) transformation 
U(x) = PO 4(a) 


in the inner product 


(sl) = : dex sd; (Wls) (arpl,) (ails) = / der gears ap (aeg) (aes) be( 4) 


while the € transformation associated with gmn is canceled by the D-dimensional 


coordinate transformation 


w(x) = p(x + €) 
4. Conservation 


There are two types of conservation laws generally found in physics: In mechanics 
we usually have global conservation laws, of the form Q = 0, associated with a 


symmetry of the Hamiltonian H generated by a conserved quantity Q: 


0 = 4H = iQ, H]=-@ 
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On the other hand, in field theory we have local conservation laws, since the action 
for a field is written as an integral f d?x of a Lagrangian density that depends only 
on fields at x, and a finite number of their derivatives. The local conservation law 
implies a global one, since 
dx d qty : 
Cf =) = 0 ae ee | ee ee 
| oar a | ten "= 9 


where we have integrated over a volume whose boundaries in space are at infin- 


ity (where J vanishes), and whose boundaries in time are infinitesimally separated. 


Equivalently, the global symmetry is a special case of the local one. 


A simple way to derive the local conservation laws is by coupling gauge fields: We 
couple the electromagnetic field A,, to arbitrary charged matter fields ¢ and demand 
gauge invariance of the matter part of the action, the matter-free part of the action 
being separately invariant. We then have 

0= 6Sy = uw (64) 5 - (50) 
using just the definition of the functional derivative 6/5. Applying the matter field 
equations 65,,/d¢ = 0, integration by parts, and the gauge transformation 6A,, = 
—O,A, we find 
a oe 
Similar remarks apply to gravity, but only if we evaluate the “current” , in this case the 
energy-momentum tensor, in flat space gmn = mn, Since gravity is self-interacting. 


We then find 
53a 


O9mn | grin — 
Imn=Nmn 
where the normalization factor of —2 will be found later for consistency with the 


pmn _ _9 


, Of) 


particle. In this case the corresponding “charge” is the D-momentum: 


pr = ae Tom 
(27) P?2 
In particular we see that the condition for the energy in any region of space to be 


nonnegative is 
700 > 0 


Exercise IIIB4.1 
Show that the local conservation of the energy-momentum tensor allows def- 
inition of a conserved angular momentum 


mn ants mmn|0 
r= | son eT 
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Note this result (local conservation of energy-momentum inplies local conser- 


vation of angular momentum) is the same as that of exercise [A4.3. 


To apply this to the action for the particle in external fields, we must first dis- 
tinguish the particle coordinates X(T) from coordinates x for all of spacetime: The 
particle exists only at « = X(r) for some 7, but the fields exist at all x. In this 


notation we can write the mechanics action as 
Sp = Jw = 9mn(2) fe 6(a — X)hu-1X™X" 
+Am(x) fe 6(a — X)X™ + o(z) fer O(a — x)e] 
using { dx 6(a — X(r)) = 1. We then have 


J™(x) = / dr 5(@ —X)X™ 


de fe OX ie 


Note that T°? > 0 (since v > 0). Integrating to find the charge and momentum: 
Q= / dri =X x= / dx exo? = 2") Sem") 


pe = fe OS a peux e(X°)5(a° — X°)y 1 X™ = e(p®)p™ 


where we have used p = v~!X (for the free particle), where p is the momentum 
conjugate to X, not to be confused with P. The factor of e(p®) (e(u) = u/|u| is the 


sign of u) comes from the Jacobian from changing integration variables from Tt to X°. 


The result is that our naive expectations for the momentum and charge of the 
particle can differ from the correct result by a sign. In particular p®, which semi- 
classically is identified with the angular frequency of the corresponding wave, can 
be either positive or negative, while the true energy P° = |p°| is always positive, as 
physically required. (Otherwise all states could decay into lower-energy ones: There 
would be no lowest-energy state, the “vacuum”.) When p° is negative, the charge Q 
and dX°/dr are also negative. In the massive case, we also have dX°/ds negative. 
This means that as the proper time s increases, X° decreases. Since the proper time 
is the time as measured in the rest frame of the particle, this means that the particle 
is traveling backward in time: Its clock changes in the direction opposite to that of 
the coordinate system x”. Particles traveling backward in time are called “antiparti- 
cles”, and have charges opposite to their corresponding particles. They have positive 


true energy, but the “energy” p° conjugate to the time is negative. 
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Exercise IIIB4.2 
Compare these expressions for the current and energy-momentum tensor to 
those from the semiclassical expansion in exercise II[A4.1. (Include the in- 


verse metric to define the square of —0,,S + qAm there.) 


5. Pair creation 


Free particles travel in straight lines. Nonrelativistically, external fields can alter 
the motion of a particle to the extent of changing the signs of spatial components of 
the momentum. Relativistically, we might then expect that interactions could also 
change the sign of the energy, or at least the canonical energy p?. As an extreme case, 
consider a worldline that is a closed loop: We can pick 7 as an angular coordinate 
around the loop. As 7 increases, X° will either increase or decrease. For example, a 
circle in the x°-a! plane will be viewed by the particle as repeating its history after 
some finite 7, moving forward with respect to time x° until reaching a latest time t,, 
and then backward until some earliest time t;. On the other hand, from the point of 
view of an observer at rest with respect to the x” coordinate system, there are no 
particles until 2° = t;, at which time both a particle and an antiparticle appear at 


the same position in space, move away from each other, and then come back together 


4 


and disappear. This process is known as “pair creation and annihilation” . 


\ bf 


at 


Whether such a process can actually occur is determined by solving the equations 
of motion. A simple example is a particle in the presence of only a static electric 
field, produced by the time component A° of the potential. We consider the case of a 
piecewise constant potential, vanishing outside a certain region and constant inside. 
Then the electric field vanishes except at the boundaries, so the particle travels in 
straight lines except at the boundaries. For simplicity we reduce the problem to two 
dimensions: 

AP = =F jor 0= gt = L, 0 otherwise 


for some constant V. The action is, in Hamiltonian form, 


oo : dr {=2" pp + v}[(p + A)? + m?]} 
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and the equations of motion are 


Dm = —U(p+A)"OmAn => Pi=E 


(pt AP =—-m? => p=+/(E+ A)? — m? 
¢ f=ptA: => ate. ears 


where EF is a constant (the canonical energy at x! = oo) and the equation p! = ... 
is redundant because of gauge invariance. We assume FE > 0, so initially we have a 


particle and not an antiparticle. 


We look only at the cases where the worldline begins at x° = 21 = —oo (lower 


left) and continues toward the right till it reaches 2° = 21 = +oo (upper right), so 
that p! = v-'¢! > 0 everywhere (no reflection). However, the worldline might bend 
backward in time (¢° < 0) inside the potential: To the outside viewer, this looks 
like pair creation at the right edge before the first particle reaches the left edge; the 
antiparticle then annihilates the original particle when it reaches the left edge, while 
the new particle continues on to the right. From the particle’s point of view, it has 
simply traveled backward in time so that it exits the right of the potential before it 
enters the left, but it is the same particle that travels out the right as came in the 


left. The velocity of the particle outside and inside the potential is 


Ee =e 
dal a outside 
: 
dx (E—-VP—m | - 
~——__~_____ inside 
E-V 


From the sign of the velocity we then see that we have normal transmission (no 


antiparticles) for E >m+V and E > m, and pair creation/annihilation when 
Veme>hom => V>2n 


The true “kinetic” energy of the antiparticle (which appears only inside the potential) 
is then —(E —V) > m. 
Exercise ITIB5.1 


This solution might seem to violate causality. However, in mechanics as well 
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as field theory, causality is related to boundary conditions at infinite times. 
Describe another solution to the equations of motion that would be inter- 
preted by an outside observer as pair creation without any initial particles: 
What happens ultimately to the particle and antiparticle? What are the al- 
lowed values of their kinetic energies (maximum and minimum)? Since many 
such pairs can be created by the potential alone, it can be accidental (and not 
acausal) that an external particle meets up with such an antiparticle. Note 
that the generator of the potential, to maintain its value, continuously loses 


energy (and charge) by emitting these particles. 
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eine eae umnemeccvaeskas C. YANG-MILLS .......0.000...ce 


The concept of a “covariant derivative” allows the straightforward generalization 
of electromagnetism to a self-interacting theory, once U(1) has been generalized to a 


nonabelian group. Yang-Mills theory is an essential part of the Standard Model. 


1. Nonabelian 


The group U(1) of electromagnetism is Abelian: Group elements commute, which 
makes group multiplication equivalent to multiplication of real numbers, or addition 
if we write U = e’©. The linearity of this addition is directly related to the linearity 
of the field equations for electromagnetism without matter. On the other hand, the 
nonlinearity of nonabelian groups causes the corresponding particles to interact with 
themselves: Photons are neutral, but “gluons” have charge and “gravitons” have 


weight. 


In coupling electromagnetism to the particle, the relation of the canonical mo- 
mentum to the velocity is modified: Classically, the covariant momentum is dx/drt = 
p+ 4A for a particle of charge q (e.g., ¢g = 1 for the proton). Quantum mechanically, 


the net effect is that the wave equation is modified by the replacement 
O—-V=0+iqA 


which accounts for all dependence on A (“minimal coupling”). This “covariant deriva- 
tive” has a fundamental role in the formulation of gauge theories, including gravity. 
Its main purpose is to preserve gauge invariance of the action that gives the wave 
equation, which would otherwise be spoiled by derivatives acting on the coordinate- 


dependent gauge parameters: In electromagnetism, 
p=eiMp, A’'=A-OX => (Vo) =e(VY) 
or more simply 
ye = PAV ei 

(More generally, g is some Hermitian matrix when ~ is a reducible representation of 
U(1).) 

Yang-Mills theory then can be obtained as a straightforward generalization of elec- 
tromagnetism, the only difference being that the gauge transformation, and therefore 


the covariant derivative, now depends on the generators of some nonabelian group. 


We begin with the hermitian generators 


Gi, G;| = —ifi;*Ge, Git = G; 
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and exponentiate linear combinations of them to obtain the unitary group elements 


g=e* A=XNG;; N= = glha=g! 


We then can define representations of the group (see subsection IB1) 
paey, yl = ple, (Gib)a = (Gia? bp 


For compact groups charge is quantized: For example, for SU(2) the spin (or, for 
internal symmetry, “isospin” ) is integral or half-integral. On the other hand, with 
Abelian groups the charge can take continuous values: For example, in principle the 
proton might decay into a particle of charge a and another of charge 1 — 7. The 
experimental fact that charge is quantized suggests already semiclassically that all 


interactions should be descibed by (semi)simple groups. 


If \ is coordinate dependent (a local, or “gauge” transformation), the ordinary 


partial derivative spoils gauge covariance, so we introduce the covariant derivative 
Va a On + iAa, Ag = ASC; 


Thus, the covariant derivative acts on matter in a way similar to the infinitesimal 


gauge transformation, 
dpa = iN'Gia dp, Vala = Ona + iAd'Gia’vp 

Gauge covariance is preserved by demanding it have a covariant transformation law 
V=e*Ve® => 6A=-[V,A]=-OA-7A,A] 


The gauge covariance of the field strength follows from defining it in a manifestly 


covariant way: 
as Vol = 1F ap > ee = ere. Fup = F °C; = Ora Ad} + i[Aa, Ab] 
= f= OfaAny' = Aq Ay* fix! 


The Jacobi identity for the covariant derivative is the Bianchi identity for the field 
strength: 
0= [V a; [Vo, Vall = 1[Vias Foal 


(If we choose instead to use antihermitian generators, all the explicit i’s go away; 
however, with hermitian generators the 2’s will cancel with those from the derivatives 
when we Fourier transform for purposes of quantization.) Since the adjoint represen- 


tation can be treated as either matrices or vectors (see subsection IB2), the covariant 


206 II. LOCAL 


derivative on it can be written as either a commutator or multiplication: For example, 


we may write either [V, F] or VF’, depending on the context. 


Actions then can be constructed in a manifestly covariant way: For matter, we 
take a Lagrangian Lyyo(0,~) that is invariant under global (constant) group trans- 


formations, and couple to Yang-Mills as 


Iuo(0,¥) — Lua = Lu o(V,Y) 


(This is the analog of minimal coupling in electrodynamics.) The representation we 
use for G; in V, = 0, +71A!G; is determined by how w represents the group. (For 
an Abelian group factor U(1), G is just the charge gq, in multiples of the g for that 


factor.) For example, the Lagrangian for a massless scalar is simply 
Lo = 3(V°6)'(Vad) 


(normalized for a complex representation). 


For the part of the action describing Yang-Mills itself we take (in analogy to the 

U(1) case) 
La(Ay) = geF nae Seu 

where 7;; is the Cartan metric (see subsection IB2). This way of writing the action 
is independent of our choice of normalization of the structure constants, and so gives 
one unambiguous definition for the normalization of the coupling constant g. (It is 
invariant under any simultaneous redefinition of the fields and the generators that 
leaves the covariant derivative invariant.) Generally, for simple groups we can choose 


to (ortho)normalize the generators G; with the condition (see subsection IB2) 
Mig = CA08; 


for some constant c4; for groups that are products of simple groups (semisimple), 
we might choose different normalization factors (but, of course, also different g’s) for 
each simple group. For Abelian groups (U(1) factors) 7; = 0, but then the gauge 
field has no self-interactions, so the normalization of the coupling constant is defined 


only by matter terms in the action, and we can replace 7; with 6;; in the above. 


Usually it will prove more convenient to use matrix notation: Choosing some 


convenient representation R of G; (not necessarily the adjoint), we write 
La(At) = grtrr FY Fo 
IR 


The normalization of the trace is determined by R, and thus so is the normalization 


convention for the coupling constant; a change in the representation used in the action 
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can also be absorbed by a redefinition of the coupling. For example, comparing the 


defining and adjoint representations of SU(N) (see subsection IB2), 
La = getrp FYFy = tra FPF > 9 = 2N Gh 
ID 894 


In general, we specify our normalization of the structure constants by fixing cr for 
some FR, and our normalization of the coupling constant by specifying the choice of 
representation used in the trace (or use explicit adjoint indices). As a rule, we find 


the most convenient choices of normalization are 
cp = 1, g9= 49D 


(see subsection IB5). 


Exercise IJIC1.1 
Write the action for SU(N) Yang-Mills coupled to a massless (2-component) 
spinor in the defining representation. Make all (internal and Lorentz) indices 
explicit (no “tr”, etc.), and use defining (N-component) indices on the Yang- 
Mills field. 


We have chosen a normalization where the Yang-Mills coupling constant g appears 
only as an overall factor multiplying the F? term (and similarly for the electromagnetic 
coupling, as discussed in previous chapters). An alternative is to rescale A > gA and 
F = gF everywhere; then V = 0+igA and F = 0A+ig[A, A], and the F? term has 
no extra factor. This allows the Yang-Mills coupling to be treated similarly to other 
couplings, which are usually not written multiplying kinetic terms (unless analogies to 
Yang-Mills are being drawn), since (almost) only for Yang-Mills is there a nonlinear 
symmetry relating kinetic and interaction terms. 

Current conservation works a bit differently in the nonabelian case: Applying 
the same argument as in subsection IIJB4, but taking into account the modified 
(infinitesimal) gauge transformation law, we find 

5S 
= 7 
Since 0,,J™ # 0, there is no corresponding covariant conserved charge. 
Exercise IIIC1.2 
Let’s look at the field equations: 


J” Vad =0 


a Using properties of the trace, show the entire covariant derivative can be 


integrated by parts as 


fu tr(A[V, B]) = - f ade tr({V, A]B), pu wtVx = - f de (Ve)ty 
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for matrices A,B and column vectors w, vy. 


b Show 
OF ap = Viad Ag 


c Using the definition of the current as for electromagnetism (subsection IIIB4), 


derive the field equations with arbitrary matter, 
LAV" Ria = Jo 
d Show that gauge invariance of the action S,4 implies 
Yivna=i 


Also show this is true directly, using the Jacobi identity, but not the field 


equations. (Hint: Write the covariant derivatives as commutators.) 


Exercise ITIC1.3 
Expand the left-hand side of the field equation (given in excercise IIIC1.2c) 
in the field, as 
LAV'R,, = 410g — ja 


where 7 contains the quadratic and higher-order terms. Show the noncovari- 
ant current 


i Jy ae 


is conserved. The 7 term can be considered the gluon contribution to the 
current: Unlike photons, gluons are charged. Although the current is gauge 
dependent, and thus physically meaningless, the corresponding charge can 
be gauge independent under situations where the boundary conditions are 


suitable. 


2. Lightcone 


Since gauge parameters are always of the same form as the gauge field, but with 
one less vector index, an obvious type of gauge choice (at least from the point of view 
of counting components) is to require the gauge field to vanish when one vector index 


is fixed to a certain value. Explicitly, in terms of the covariant derivative we set 
eV =S peAH]o 


for some constant vector n*. We then can distinguish three types of “axial gauges”: 
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(1) “Arnowitt-Fickler”, or spacelike (n? > 0), 
(2) “lightcone”, or lightlike (n? = 0), and 
(3) “temporal”, or timelike (n? < 0). 
By appropriate choice of reference frame, and with the usual notation, we can write 
these gauge conditions as V! = 61, Vt = 0, and V° = 0°. 

One way to apply this gauge in the action is to keep the same set of fields, but 
have explicit n dependence. A much simpler choice is to use a gauge choice such as 


Ay = 0 simply to eliminate Ag explicitly from the action. For example, for Yang-Mills 


we find 
A=0 > h=A = 2(Fiy)? = —1(A;)? + 2(Fi)? 
where “ '” here refers to the time derivative. Canonical quantization is simple in 


this gauge, because we have the canonical time-derivative term. However, the gauge 
condition can’t be imposed everywhere, as seen for the corresponding gauge for the 
one-dimensional metric in subsection IIIB2, and in our general discussion in subsec- 
tion IITA5: Here we can generalize the time-ordered integral for the temporal gauge 


to an integral path-ordered with respect to a straight-line path in the n direction: 


e kn V (2) = gee ee e tA(@,.e—kn) =p leap (-: dx’ - tw) 
a—kn 


Applying this gauge transformation to n- V, as in subsection IIIA5, fixes n- A toa 


constant with respect to n- 0; the effect on all of V is: 
V' (x) = eee (je ee) S V(r 4 kn) _ er Oy (nie Ve 


For example, for the temporal gauge, if we choose “x” to be on the initial hypersurface 


x° = tg, then we can choose k = t — to so that V’ is evaluated at arbitrary time t: 
Vile) = ew vVialaie rr ™)| 4) |e 


By Taylor expanding in k, this gives an explicit expression for A, at all times in terms 
of A,, and Fy and its covariant time-derivatives, evaluated at some initial time, but 
with simply Ao(t, x") = Ao(to, x’). Thus, we still need to impose the Apo field equation 


[V;, A;] = 0 as a constraint at some initial time. 


Exercise IJIC2.1 
First set Ag = 0, then derive the field equations for A; from the Yang-Mills 
action. Compare the results of exercise IIIC1.2c for J = 0. Show explicitly 
that these field equations imply the time derivative of the constraint [V;, Ai] = 
0. (Hint: Write everything in terms of F’’s and V’s till the end.) 
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In the case of the lightcone gauge we can carry this analysis one step further. 
In subsection IIB3 we saw that lightcone formalisms are described by massless fields 
with (D—2)-dimensional (“transverse”) indices. In the present analysis, gauge fixing 


alone gives us, again for the example of pure Yang-Mills, 


A= = Ftteaatat, Ft =OtA, FCsSD A’ -[V',A] 


= i(Fe)? = L(t a-y? — Lat aiy(a- At — [Vi Ao]) + FY)? 


1 
8 
In the lightcone formalism O~ (—0,) is to be treated as a time derivative, while 
O* can be freely inverted (i.e., modes propagate to infinity in the x* direction, but 
boundary conditions set them to vanish in the x direction). Thus, we can treat A7 
as an auxiliary field. The solution to its field equation is 

A 


1 a t 
= salVi, ot) 


which can be substituted directly into the action: 


salV!, 0+ Al 


1 
at 
[A’, 0*.A) 


(FO) = pAOtO Al + 3(FY) — GIV', Ot Al 


L 
8 
= — SA'DA' + is [A’, AOA? + i5(0'A’) 

1 
Ore 


[A’, 07 A’ 


[A’, AY)’ + g[A’, OF 4") 


il 
8 

We can save a couple of steps in this derivation by noting that elimination of any 
auxiliary field, appearing quadratically (as in going from Hamiltonian to Lagrangian 


formalisms), has the effect 
L= saa” +bx+ce—- —4ax"| 91 /ax=0 + Leo 
In this case, the quadratic term is (F~*)?, and we have 


a(R)? =, (Fs)? = shtR 7 s(Ft~)? ome a(FY)? _ 3(0* A’) (07 A’) 6 s(Ft~)? 


Oo] 


where the last term is evaluated at 
a = i i = Lei i 
0=[V.FM|=-OFOHIVEFY = Pha Siviry 


1 
oF 


> L=1(F%)?+laiate Ai — LV! ot A] [V!, ot Al] 


as above. 


In this case, canonical quantization is even simpler, since interpreting O~ as the 


time derivative makes the action look like that for a nonrelativistic field theory, with 
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a kinetic term linear in time derivatives (as well as interactions without them). The 
free part of the field equation is also simpler, since the kinetic operator is now just 
O1. (This is true in general in lightcone formalisms from the analysis of free theories 
in chapter XII.) In general, lightcone gauges are the simplest for analyzing physical 
degrees of freedom (within perturbation theory), since the maximum number of de- 
grees of freedom is eliminated, and thus kinetic operators look like those of scalars. 
On the other hand, interaction terms are more complicated because of the nonlo- 
cal Coulomb-like terms involving 1/0*: The inverse of a derivative is an integral. 
(However, in practice we often work in momentum space, where 1/p* is local, but 
Fourier transformation itself introduces multiple integrals.) This makes lightcone 
gauges useful for discussing unitarity (they are “unitary gauges”), but inconvenient 
for explicit calculations. However, in subsection VIB6 we’ll find a slight modification 
of the lightcone that makes it the most convenient method for certain calculations. 
(In the literature, “lightcone gauge” is sometimes used to refer to an axial gauge 
where A* is set to vanish but A™~ is not eliminated, and D-vector notation is still 
used, so unitarity is not manifest. Here we always eliminate both components and 


explicitly use (D — 2)-vectors, which has distinct technical advantages. ) 


Although spin 1/2 has no gauge invariance, the second step of the lightcone 
formalism, eliminating auxiliary fields, can also be applied there: For example, for a 
massless spinor in D=4, identifying 9°? = A- as the lightcone “time” derivative, we 
vary W° (or W®) as the auxiliary field: 


iL = POM Ye + POM YF — PIM YL® — POM 


ee 


Oo Ob 8 
=  Y a % 
1 
—» sO 
_ 7b 2 ® 
= ee aoe” 


This tells us that a 4D massless spinor, like a 4D massless vector (or a complex scalar) 
has only 1 complex (2 real) degree of freedom, describing a particle of helicity +1/2 
and its antiparticle of helicity —1/2 (+1 for the vector, 0 for the scalar), in agreement 
with our general discussion of helicity in subsection IIB7. On the other hand, in the 
massive case we can always go to a rest frame, so the analysis is in terms of spin 
(SU(2) for D=4) rather than helicity. For a massive Wey] spinor we can perform the 
same analysis as above, with the modifications 

Lo 1+ Boye give) = 1-920) 


® 
5 p 
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where we have dropped some terms that vanish upon using integration by parts and 
the antisymmetry of the fermions. So now we have the two states of an SU(2) spinor, 
but these are identified with their antiparticles. This differs from the vector: While 
for the spinor we have 2 states of a given energy for both the massless and massive 
cases, for a vector we have 2 for the massless but 3 for the massive, since for SU(2) 


spin s has 2s+1 states: 


4D states of given EF: 


Exercise IITIC2.2 
Show that integration by parts for 1/0 gives just a sign change, just as for 0. 


In general dimensions, massless particles are representations of the “little group” 
SO(D—2) (the helicity SO(2) in D=4), as described in subsection IIB3. Massive 
particles represent the little group SO(D—1), corresponding to dimensional reduction 


from an extra dimension, as described in subsection ITB4. 


3. Plane waves 


The simplest nontrivial solutions to nonabelian field equations are the general- 
izations of the plane wave solutions of the free theory. We begin with general, free, 
massless theories, as analyzed in subsection IIB3. In the lightcone frame only pt is 
nonvanishing. In position space this means the field strength depends only on x~. 
This describes a wave traveling at the speed of light in the positive x! direction, with 
no other spatial dependence (i.e., a plane wave). We allow arbitrary dependence on 
x, corresponding to a superposition of waves with parallel momenta (but different 
values of pt). While its dependence on only 2~ solves the Klein-Gordon equation, 
Maxwell’s equations are solved by giving the field strength as many upper + indices 


as possible, and no upper —’s. 


Generalizing to interactions, we notice that the Yang-Mills field equations and 
Bianchi identities differ from Maxwell’s equations only by the covariantization of the 
derivatives (at least for pure Yang-Mills). Because Maxwell’s equations were satisfied 
by just restricting the index structure, we can do the same for the covariant derivatives 
by assuming that only V* is novanishing on the field strengths. In other words, we 
can solve the field equations and Bianchi identities by choosing the only nontrivial 


components of the gauge fields to be those in VT. 


C. YANG-MILLS 213 


The final step is to solve the relation between covariant derivative and field 
strength. This is simple because the index structure we found implies the only non- 


trvial commutators are 


In particular, this implies that the gauge fields have no x* dependence, and only a 


very simple dependence on x’. We find directly 
age) 


where F"* (a7) is unrestricted (other than the explicit index structure and coordinate 
dependence). Of course, this result can also be used in the free theory, although it 
differs from the usual lightcone gauge. 
Exercise IJIC3.1 
Gauge transform this solution to the lightcone gauge At = 0 in the Abelian 
case. 


Exercise ITIC3.2 


Translate the above results into spinor notation in D=4. 


4. Self-duality 


The simplest and most important solutions to the field equations are those that 


are invariant under the “duality” symmetry that relates electric and magnetic charge: 
[Va Va] = £3€abcalV%, V4] 
Applying the self-duality condition twice, we find 
Feabe feet = +65f,55) 


which requires an even number of time dimensions. For example, since the action 
is usually Wick rotated anyway for perturbative purposes, we might assume that 
we should do the same for classical solutions that are not considered as “small” 
fluctations about the usual vacuum. (Such a Euclidean definition of field theory 
has been considered for a mathematically rigorous formalism, called “constructive 
quantum field theory”, since the Gaussian path integrals for scalars and vectors are 
then well-defined and convergent. However, other spins, such as for fermions or 


gravity, are a problem in this approach.) Alternatively, we can replace €,5-q with 
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1€qbeq and complexify our fields. The self-duality condition, when combined with the 


Bianchi identities, implies the field equations: For Yang-Mills, 


Viele = Sete Vere Vee a Ve 


Since the self-duality condition is only first-order in derivatives, it’s easier to solve 


than the usual field equations. 


Plane wave solutions provide a simple example of self-duality, since the field 
strengths can easily be written as the sum of self-dual and anti-self-dual parts: In 
Minkowski space we define the self-dual part as helicity +1 ( fx3)> and anti-self-dual 
as —1 (fag). For example, for a wave traveling in the “1” direction, the Ft? =iF*? 
components give the two self-dualities for Yang-Mills, describing helicities +1 (the 


two circular polarizations). 


Exercise IIIC4.1 
Generalize the results of the previous subsection to more general waves, with 
an A* which is a general function of x~ and 2’ (with the other components 
of A still vanishing). 


a Find the field strength, and show it satisfies the interacting field equations if 


A” satisfies the free Laplace equation 
(0')?At =0 
b In D=4, the solution to this equation is 
At = f(z ,2")+ f(a, 2") 


Show this decomposition describes the two separate helicities. 


Before further analyzing solutions to the self-duality condition, we consider ac- 
tions that use self-dual fields directly. This will allow us to describe not only theories 
whose only solutions are self-dual, but also more standard theories as perturbations 
about self-duality, and even massive theories. The most unusual feature of this ap- 
proach is that complex fields are used without their complex conjugates, since this 
is implied in D=3+1 by self-duality. (Alternatively, we can Wick rotate to 2+2 di- 
mensions, where all Lorentz representations are real.) There are two stages to this 
approach: (1) Use a first-order formalism where the auxiliary field is self-dual. The 
usual first-order actions for spin 1/2 (Weyl or Dirac) already can be interpreted in 
this way, where “self-duality” means “chirality”. (2) For the massive theory, elimi- 


nate the non-self-dual field (as an auxiliary field, as allowed by the mass term), so 
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that the dynamics is described by the self-dual field, which was formerly considered 


as auxiliary. The massless theory then can be treated as a limiting case. 


The simplest (and perhaps most useful) example is massive spin 1/2 coupled in 


a real representation to Yang-Mills fields: 


L= PAV aha + sem ba + oda) 


“T”) refers to the Yang-Mills group index (with respect. to 


where the transposition ( 
which the spinors are column vectors). Note that ~ must be a real representation of 
this group (A? = —A) for the mass term to be gauge invariant (unless the mass term 
includes scalars: see the following chapter). Even though w and w are complex conju- 
gates, they can be treated independently as far as field equations are concerned, since 
they are just different linear combinations of their real and imaginary parts. (Com- 
plex conjugation can be treated as just a symmetry, related to unitarity.) Noticing 
that the quadratic term for 7 has no derivatives, we can treat it as an auxiliary field, 
and integrate it out (i.e., eliminate it by its equation of motion, which gives an explicit 


local expression for it): 
Le 32 (O— mya + ahi fe 
where we have used the identity 
Val V8s = 4{V a, VPs} + 4[V 07, Vs] = —4680 — ify? 


whose simplicity followed from w being a real representation of the Yang-Mills group. 
(Of course, we could have eliminated w instead, but not both.) For convenience we 


also scale w by a constant 
p> rir 
to find the final result 


LL 7h — m*)a — 3 fa’ bp 


Now the massless limit can be taken easily. This action resembles that of a scalar, plus 
a “magnetic-moment coupling”, which couples the “(anti-)self-dual” (chiral) spinor 
Wa to only the (anti-)self-dual part fog of the Yang-Mills field strength. 


For the same reason, the kinetic operator can be written in terms of just the 


self-dual part Syg of the spin operator: 


L= ty" (O- m? —if?? Spa) 
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This operator is of the same form found by squaring the Dirac operator: 
29? = —2y- VP = (177° + DV aVe = 0 iF Spa 


except for the self-duality. The simple form of this result again depends on the 
reality (parity invariance) of the Yang-Mills representation; although this squaring 
trick can be applied for complex representations (parity violating), the coupling does 
not simplify. This is related to the fact that real representations are required for our 


derivation of the self-dual form. 


In the special case where the real representation is the direct sum of a complex 
one w+. with its complex conjugate w_, (as for quarks in the Standard Model, or 


electrons in electrodynamics), we can rewrite the Lagrangian as 
Le = —3y4°(O — m*) ha — bp fa? b_¢ 


The method can also be generalized to the case of scalar couplings, but the action 


becomes nonpolynomial. 


For spin 1, we start with the massless case. We can write the Lagrangian for 
Yang-Mills as 
LS ir(G™ fag — 39° G2) 
where Gag is a (anti-)self-dual auxiliary field. Although this action is complex, elim- 
inating G by its algebraic field equation gives the usual Yang-Mills action up to a 
total derivative term (€F,,, Fa), which can be dropped for purposes of perturbation 


theory. For g = 0, this is an action where G acts as a Lagrange multiplier, enforcing 
the self-duality of the Yang-Mills field strength. 


If we simply add a mass term 
i. _ iGo ea 


then A can be eliminated by its field equation, giving a nonpolynomial action of the 
form 
L+ Lm — —3(0G)[(%)? + G] (0G) — 59°C? 
Just as the spin-1/2 action contained only a 2-component spinor describing the 2 
polarizations of spin 1/2, this action contains only the 3-component Gg, describing 
the 3 polarizations of (massive) spin 1. 
Exercise IIIC4.2 


Find the Abelian part of this action. Show the free field equation is 
(O — m?)Gog = 0 (without gauge fixing). 
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5. Twistors 


In four dimensions with an even number of time dimensions, the “Lorentz” group 
factorizes (into SU(2)? for D=4+0 and SL(2)? for D=2+2). This makes self-duality 
especially simple in spinor notation: For Yang-Mills (cf. electromagnetism in subsec- 
tion IIA7), 

[Vor ve] aicorsee (f° = 0) 


where we have written primes instead of dots to emphasize that the two kinds of 
indices transform independently (instead of as complex conjugates, as in D=3+1). For 
purposes of analyzing self-duality within perturbation theory, we can use a lightcone 
method that breaks only one of the two SL(2)’s (or SU(2)’s), by separating out its 


indices into the @ and © components: 


/ 


Ves Ve 0 ve ao 


where we have chosen a lightcone gauge: The vanishing of all field strengths for the 


covariant derivative V®’ says that it is pure gauge (as seen by ignoring all but the 


/ . 
x°* coordinates). We now solve 


[Ver Vor =aO a YO = 98% 4708" 6 


i.e., VO — 0° has vanishing curl, and is therefore a gradient. We therefore have 


/ 


AOV = 0, APT SOP Ge FP a 9h OP? 6 


These can also be written in terms of an arbitrary constant twistor e* (= 62 above) 
as 
AW = GF" (—ie%e,6), po So" Oo Geo) 


The final self-duality condition [V°°, V°"'] = 0 then gives the equation of motion 


1O1¢ + (°°) (0% a) = 0 


Exercise ITIC5.1 
Show that the sign convention for Wick rotation of the Levi-Civita tensor 


consistent with the above equations is 
d 
Le = 5€abea lt ; Foo! BB! = Cap tap! => 


Eaa! BB y7'55" = Cape ysCars' Cary — CasCayCarpr Cys 
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Exercise IIIC5.2 
Look at the action G°’ fg for self-dual Yang-Mills in the lightcone gauge, 
using the results above. Show that this action is equivalent to the lightcone 
action for ordinary Yang-Mills (subsection IIIC2), with some terms in the 


interaction dropped. 


At least for 4D Yang-Mills, advantage can be taken of the conformal invariance 
of the classical interacting theory by using a formalism where this invariance is man- 
ifest. We saw in subsection IA6 that classical mechanics could be made manifestly 
conformal by use of extra coordinates. Covariant derivatives can be defined in terms 
eee 


of projective lightcone coordinates, but the twistor coordinates z see subsections 


IIB6 and IIIB1) are more useful. The self-dual covariant derivatives then satisfy 


[V.Aa, Vee] = t1Cas fas 


in direct analogy to 4D spinor notation. This equation also can be solved by the 
lightcone method used above, but now this method breaks only the internal SL(2) 
symmetry, leaving SL(4) conformal symmetry manifest. More general self-dual field 
strengths in this twistor space are also of the form f,4..2, totally symmetric in the 


indices. We also need to impose the constraint on the field strength 
aaa = 0 


(and similarly for the more general case) to restrict the range of indices to the usual 
4D spinor indices (in which the field strengths are totally symmetric). Self-duality 
implies the Bianchi identity 

Vial eic = 9 


which also generalizes to the other field strengths, and is the equivalent of the usual 
first-order differential equations (Dirac, Maxwell, etc.) satisfied by 4D field strengths. 
As usual, it in turn implies the interacting Klein-Gordon equation, which in the 


Yang-Mills case is 
3V A Velateo = —iL feta, fai) 


The Bianchi together with the z index constraint imply the constraint on coordinate 
dependence 
(24°V as + 55) fac = 0 


which eliminates dependence on all but the usual 4D coordinates. These four equa- 
tions are generically satisfied by self-dual field strengths. The self-duality itself of the 


field strengths is a consequence of their total symmetry in their indices, and the fact 
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that they are all lower (SL(4)) indices. (The z index constraint then reduces them to 
SL(2) Weyl indices all of the same chirality.) 


Exercise ITIC5.3 


Derive the last three equations from the previous two (self-duality and zf=0). 


Exercise ITIC5.4 
Show that non-self-dual Yang-Mills is conformally invariant in D=4 by extend- 
ing the (4+2)-dimensional formalism of subsections IA6 and IIIB1 (especially 
exercise IIIB1.5): Show the field strength 


Fapc = —i5yalV a, Vel] 
satisfies the gauge covariances 
5Aa = —[Va,A] — yar 
and Bianchi identities 
y[AF’ecp) = V af Bcp) = 0 
The duality transformation 
F'apc > @€ABCDEFF oe 
then suggests the field equations 
y4Fapc = V“4Fazc = 0 
in addition to the usual constraint 
y’Fapc = 0 


By reducing to D=4 coordinates with the aid of the above yF conditions, show 
F reduces to the usual field strength, and the remaining equations reduce to 


the usual gauge transformation, Bianchi identity, duality, and field equation. 
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6. Instantons 


Another interesting class of self-dual solutions to Yang-Mills theory are “instan- 
tons”, so called because the field strength is maximum at points in spacetime, unlike 
the plane waves, whose wavefronts propagate from and toward timelike infinity. A 
particular subset of these can be expressed in a very simple form by the ’t Hooft 
ansatz in terms of a scalar field: In twistor notation, choosing the Yang-Mills gauge 
group GL(2) (in 2+2 dimensions, or SU(2)®GL(1) for 4+0), 


tA pu” = —060AIn 6b => tAdja’ = —Oraln b 


so the GL(1) piece is pure gauge, and has been included just for convenience. Note 
that this ansatz ties the SL(2) twistor index with the SL(2) gauge group indices (1, «), 
but in this notation the index that carries the spacetime (conformal) symmetry is free. 
Imposing the self-duality condition on the field strength, and separating out the terms 


symmetric and antisymmetric in AB, we find 
ifan.” = —5360(4"0p).b 
604° Opad = 0 


The “field equation” for ¢ is just the twistor version of the (free) Klein-Gordon 
equation, and its solution is the projective lightcone version of 4D point sources (see 


subsection IAG): Since for any two 6D lightlike vectors y and y/’ 
y=e(z,1,427) => yy = —Jee(r—2')? 


we have the solution a 
+ 


=v), yr =0 
i=1 


with y given in terms of z as before, and y; are constant null vectors. “k” is the number 
of instantons. (The one term for k = 0 is pure gauge.) The usual singularities in 
the Klein-Gordon equation at y = y; are killed by the extra factor of ¢~! in the field 


equation. 


Exercise IIIC6.1 
Let’s check the Klein-Gordon equation for y # y; directly in twistor space. 
We will need the identity 


y~=0 > Yiapyicd) = 0 [ y) 


a 
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in the product 
YY = 3Y yew 
Prove this identity in two ways: 


a Show it follows from the definition 


2_ 1,ABCD 
Yi = GE YiABYiCD 


b Show it follows from plugging in the solution to the lightlike condition, 
YiAB = Zi A°ZiBa 
c Now use the identity to show the above solution satisfies its field equation by 
evaluating the z derivatives. 
We can rewrite this in the usual 4D coordinates by transforming from 24% to \°% 
and x” as 24, = Aq” (54, 2H") (see subsection IIB6): 
de" A ja,” = dal” Ayu,” + [OA gle Aga”® = da Agi® = 10 “a, 


where in the first step we have used the expression for z in terms of A and x, and in 


the second we used the result that 
(24%A49 + 58) = 0 


We now recognize that the gauge transformation that gets rid of all but the “x 
components” of A (whose existence is guaranteed by the condition 24 f 4, = 0) uses 


A itself as the gauge parameter: 
dzA° A ja® = —iX7 dN + A (dz Alga) Mn 


The net result is that A can be reduced to an ordinary 4-dimensional expression by 
just setting \ = 0 in the original expression. Then 

Ayn = —OOyln do, = aaa ap 

(x — 2;)? 

with y in terms of x and a scale factor (worldline metric) e as in subsections IA6 

and IIIB1 (and dropping an overall factor that doesn’t contribute to A). Note that, 

unlike the expression in twistor space, where conformal invariance is manifest, here 


Lorentz invariance is tied to the Yang-Mills symmetry. 


Exercise IIIC6.2 
Show in 4D coordinates that the gauge-invariant quantity tr(f?) is finite at 
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the points x = x;, where A is singular. (This means that the gauge choice is 
singular, not physical quantities.) 
Another important property of instantons is that they give finite contributions to 


the action. In vector notation, we have 


d+zx d+x 
pe = 5c Fg — ee aatr f 7 Fe Fy _ ieee med et 


(27)? 


The last expression can be reduced to a boundary term, since 
gtr Pap l ed — 7 {a Boed| 
in terms of the “Chern-Simons form” 


Babe = tr(SAOvAgq + iz AjaApAgq) 


Exercise IITIC6.3 
Although the Chern-Simons form is not manifestly invariant, its variation is, 


up to a total derivative: 


a Show that its general variation is 


OP ses = 5 tr[(0 Aja) Fog => OiaArd Ag] 


b Show the gauge transformation of B is 


0 Babe = —50|aXvd; ab = str (ADiaAy) 


If we assume boundary conditions such that F’ drops off rapidly at infinity, then 
A must drop off to pure gauge at infinity: 


iAm > 8 Omg 


Since instantons always deal with an SU(2) subgroup of the gauge group, we’ll assume 
now for simplicity that the whole group is itself SU(2). Then the action can be given 
a group theory interpretation directly, since the integral over the surface at infinity 
is an integral over the 3-sphere, which covers the group space of SO(3), and thus half 
the group space of SU(2). Explicitly, 


_ 1 3 1_mnpq 
S= fa Om rag Brain 


872g? 


— sce f Com $e™tr(g10,8)(8"9pB)(8" 28) 
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= stp f tr fe tr(@18)(619)8)(e "Ahw) 

where in the last step we have switched to coordinates for the 3-sphere, using the fact 
J d*x €”"?" finnpq is independent of coordinate choice. In fact, in the case where g is 
a one-to-one map between the 3-sphere and the group SO(3), this last expression is 
just the definition of the invariant volume of the SO(3) group space. In that case, the 
integral gives just the volume of the 3-sphere (277). In general, the map g will cover 
the SU(2) group space an integer number gq of times, and thus cover the SO(3) group 
space 2q times, so the result will be 

_ lal 

29? 

where we have used the fact that self-dual solutions have g > 0 while anti-self-dual 
have q < 0. 


(Anti-)self-dual solutions give relative minima of the action with respect to more 


general field configurations: 


OA te / 5( Fup + F€abceak)? = tr / (F? + he¥ 1B, Fug) 


q is an integer, and thus can’t be changed by continuous variations: It is a topological 
property of finite-action configurations. Thus the self-dual solutions give absolute 
minima for a given topology. (All these solutions will be given implicitly by twistor 
construction in the following subsection. Note that our normalization for the structure 
constants of SU(2) differs from the usual, since we use effectively tr(G;G;) = 6;; 
instead of the more common tr(G;Gj) = $6;j;, which would normalize the structure 
constants as in SO(3): fij, = ij. The net effect is that our g° contains a relative 
extra factor of 1/2, in addition to the effective extra factors coming from our different 


normalization of the action.) 


Exercise IIIC6.4 
Explicitly evaluate the integral for the instanton number q for the solutions 
of the ’t Hooft ansatz. Show that the asymptotic form can be expressed in 
= x. (det g £1 because of the GL(1) piece.) Note that 


there are boundary contributions not only at x = oo but also around the 


K 


terms of (g)) 


singular points x = x;, which are of the same form but opposite sign. (Since 


the singular parts of A are pure gauge, they cancel in F’.) 


At the quantum level, instantons are important mostly because they are an ex- 
ample of fields that don’t fall off rapidly at infinity, and thus contribute to [ «FF. 
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However, once the restriction on boundary conditions is relaxed, there can be many 
such field configurations. The instantons are then distinguished by the fact that they 
are the minimal action solutions for a given topology; this makes them important for 


describing low-energy behavior. 


7. ADHM 


Much more general solutions of this form can be constructed using twistor meth- 
ods. (In fact, they can be shown to be the most general self-dual solutions that fall 
off fast enough at infinity in all directions.) The first step of the Atiyah-Drinfel’d- 
Hitchin-Manin (ADHM) construction is to introduce a scalar square matrix in a larger 
group space 

U" = (uy', via) (I = (0, ia)) 


The index a is the usual two-valued twistor index, for SU(2) in Euclidean space or 
SL(2) in 2+2 dimensions. The other indices are 


SO(N+4k) GL(2k) 


SU(N+2k) (SL(N+2k)) | GL(k,C) 
USp(2N+2k) (Sp(2N+2k)) | GL(k) 


The index v is for the defining representation of the Yang-Mills group H, which is 
any of the compact classical groups for Euclidean space, but is its real Wick rotation 
for 2+2 dimensions. The index J is for the defining representation of the group G, 
a larger version of H, where k is the instanton number. Finally, the index 7 is for a 


general linear group. We also have the matrix 
UF = (ul, u va) 


For the SO and (U)Sp cases both U matrices are real, for the SU case they are complex 
conjugates of each other, and for the SL case they are real and independent. We next 
relate the two U’s by 


i K K na I L I 
U ty =0,, Uti =? rer =), C evng= Cooge 


so they are almost inverses of each other, except that the “metric” g is not constrained 


to be a Kronecker 6. We then write the gauge field as a generalization of pure gauge: 


TA Ae = uw! Orqtr" 
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(This is similar to the method used for nonlinear ¢ models of coset spaces G/H as 


discussed in subsection IVA3 below, except for g.) 


Self-duality then follows from requiring a certain coordinate dependence of the 


U’s: This is fixed by giving the explicit dependence of the v’s as 
Ulia = brake, U ita = Diaz a 


where the 6’s are constants. The orthonormality conditions on the U’s then implies 
the constraint on the b’s 
bn abrisy = 0 


as well as determining the u’s in terms of the b’s (with much messier dependence than 
the v’s), and thus A. Note that the z dependence of u can be written in terms of just 


x, as follows from rewriting the wv orthogonality as (after multiplying by z) 
w bray” = ur'D pay” = 0 


and noting scale invariance. Then the x components of A can also be written in 
terms of just x. We then can check the self-duality condition by calculating f: The 


orthonormality condition on the U’s can be written as 
bf = uu, = vpieg? V" ite 
where g‘’ is the inverse of g;. Then schematically we have 
iF =d0iA+iAiA 
= (Ou) (Ou) — (Ou) uti(du) 
= (Ou)vgi(du) 
= u(Ov)g(Ov)u 
= ubgbu 
or more explicitly 
tfapi” = —(ul brica)g” (usb vp) 
where self-duality is Faces = Casfas. We can also directly show 24° fag = 0. 


Exercise ITTC7.1 
Solve the bb constraint for k=1 and H=SU(2), and compare to the 1-instanton 


solution of subsection IIIC6. 
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8. Monopoles 


Instantons are essentially 0-dimensional objects, localized near a point in 4- 
dimensional spacetime (or many points for multi-instanton solutions). Another type 
of solution is 1-dimensional; this represents a particle (with a 1D worldline). Unlike 
the plane-wave solutions, which represent the massless particles already described 
explicitly by fields in the action, we now look for time-independent solutions, which 


describe massive (since they have a rest frame), bound-state particles. 


Looking at time-independent solutions is similar to the dimensional reduction 
that we considered in subsection IIB4 to introduce masses into free theories, only 
(1) this mass vanishes, and (2) we reduce the time dimension, not a spatial one. In 
our case, the dimensional reduction of a 4-vector (the Yang-Mills potential) gives a 
3-vector and a scalar, both in the adjoint representation of the group. Let’s consider 
the reduction in Euclidean space, so the scalar kinetic term comes out with the right 
sign. Then the 4D Yang-Mills action reduces as 


ee eal er 


where we have labeled the scalar Ag = ¢ and by dimensional reduction 09 — 0. Note 
that this is the same action that would have been obtained by starting out with Yang- 
Mills coupled to an adjoint scalar in four dimensions, either Minkowski or Euclidean, 
and choosing the gauge Ag = 0. Thus, time-independent solutions to Euclidean 
Yang-Mills theory are also time-independent solutions to Minkowskian Yang-Mills 
coupled to an adjoint scalar (although not the most general, since the gauge Ap = 0 
is not generally possible globally, especially when we assume time independence of 
even gauge-dependent quantities). In particular, this means that time-independent 
solutions to self-dual Yang-Mills are also solutions of Minkowskian Yang-Mills cou- 
pled to an adjoint scalar. This allows us to use the first-order differential equations 
and topological properties of self-dual Yang-Mills theory to find physical bound-state 


particles in this vector-scalar theory. 


Dimensionally reducing the (Euclidean) self-duality condition, we have 
—[Vi, g] = 56ighl ik 


As for instantons, the simplest solutions are for SU(2). As for the ’t Hooft ansatz, 
we look for a solution that is covariant under the combined SU(2) of the gauge group 
and 3D rotations: In SO(3) vector notation for both kinds of indices (using the SO(3) 


normalization of the structure constants [iG;,iGj] = €:;47Gx), 


bi = Xi9(r), (Ai); = €ijrtrA(r) 
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(We know to use an ¢€ tensor in A because of covariance under parity.) The self- 
duality equation then reduces to two nonlinear first-order differential equations (the 


coefficients of 6;; and x;x,/r?): 
-p=—rAp=2A+rA, -—ro'+rAge==-rA +77? 


After some massaging, we find the change of variables 


~=++ry, A=i+rA 
leads to the simplification 
g=-4, A=-AZ 
~ then can be eliminated, giving an equation for A. Making a final change of variables, 
=A = w-WP=-1 
we can guess the solution (with regularity at r = 0) 


: 1 k 1 
w=k'sinh(kr) > A= 2 (— — 1) . pS valkr coth(kr) — 1] 


Exercise IJIC8.1 
Repeat this calculation in spinor notation: 
a In Euclidean space we can choose of ~ Cagr. Show that we can then write 
the 4-vector potential for the monopole as 
i(Aaa)”? = dag? Ay(r) + 65277 A_(r) 
which is symmetric in neither a@ nor yd. (Compare the ’t Hooft ansatz in 
subsection IIIC6.) However, 7°’ is now symmetric from dropping 2°. 


b Impose self-duality, where 


Bape”? = 567,08) = 6255 — 3CopC™ 


& 


from subtracting out the Opx° piece. Derive the resulting equations for Ax, 


and show they agree with the above for 


As =-3(A+ 9) 


In general, the Lagrangian of a Euclidean theory is the Hamiltonian of the 


Minkowskian theory (with the sign conventions we introduced in subsection HIA1), 
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since Wick rotation changes the sign of the kinetic energy and not the potential en- 
ergy. In our case, this means the Minkowskian energy of the Yang-Mills + adjoint 
scalar theory can be evaluated in terms of the same topological expression we used for 
instantons: From the previous subsection, using S = f{ dt E and dy = 0, we evaluate 


in Euclidean space 
B= rg f Pos CO Bose,  € Boje 4 —eajutr (OF jx) = 2 tr(b[Vi, ol) = & tr(¢’) 


where we have used an integration by parts to simplify B. (Compare exercise 
IIIC6.3a.) Since at spatial infinity 


k 1 
and effectively ¢ d?a; — 4nrx;, we find 
ik 
27g? 


Also by similar arguments to those used for instantons, we see that any solutions with 
boundary conditions A — 0, |¢| — |k| as r — oo have energy at least as great as 


this. There is also a topological interpretation to this energy: Writing it as 
—a ¢ do; exutr(() Fin) 


we see that the energy is proportional to the magnetic flux, i.e., the “magnetic charge” 
of the monopole. (The asymptotic value (¢) of @ picks out a direction in isospace, 
reducing SU(2) to U(1).) As in electromagnetism, magnetic charge is quantized in 
terms of electric charge. However, for compact gauge groups, electric charge is also 
quantized. (For the usual U(1), charges are arbitrary, but for SU(2), any component 
of the isospin is quantized.) The energy is thus quantized in terms of k: It is a 


multiple of the energy we found for the single monopole above. 


Exercise IIIC8.2 
Perform a singular gauge transformation that makes (¢) point in a constant 
(rather than radial) direction in isospin (SU(2)) space. Show that the isospin 
component of the asymptotic form of A describes a U(1) magnetic monopole: 


magnetic flux radiating outward from the origin. 
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IV. MIXED 


In this chapter we consider ways in which gauge symmetry combines with global 
symmetries for new effects. The interplay between global internal symmetries of scalar 
and spinor theories and local symmetries of Yang-Mills is important for understanding 


mass generation for all spins, and is fundamental for the Standard Model. 


anadbhneenisoutes A. HIDDEN SYMMETRY .................. 


Symmetries, especially local ones, are clearly very important in the formulation 
of interactions. However, symmetries are not always apparent in nature: For ex- 
ample, while most symmetries prefer massless particles, of all the observed particles 
the only massless ones are the graviton, photon (probably), and (some) neutrinos. 
Furthermore, of the massive ones, none with different properties have the same mass, 
although some are close (e.g., the proton and neutron). There are three solutions to 


this problem: 


(1) The symmetry is not a property of nature, but only an approximate symmetry. 
Some terms in the action are invariant under the symmetry, but other terms 
violate it. We can treat such “explicit symmetry breaking” by first studying the 
symmetry for the invariant terms, and then treating the breaking terms as a 
perturbation. 


(2) Although the laws of physics are symmetric, nature is an asymmetric solution to 
them. In particular, such a solution is the “vacuum”, or state of lowest energy, 
with respect to which all other states are defined. Since the vacuum is not invari- 
ant under the symmetry, the symmetry transformations take the vacuum to other 
states of the same energy. This case is called “spontaneous symmetry breaking”. 
For example, in electrodynamics an infinite charge distribution of constant den- 
sity is translationally and rotationally invariant, but by Gauss’ law we know there 


must be an electric field, whose direction breaks rotational invariance. 


(3) The particles in terms of which these laws are formulated are not those observed 
in nature. For example, the hydrogen atom is most conveniently described in 
terms of a proton and an electron, but in its low-energy physics only the atom 
itself is observed as a separate entity: The U(1) symmetry related to charge is 
not seen from the neutral atoms. The more extreme case where such particles 


always appear in bound states is known as “confinement” . 


Generally, such broken symmetries are at least partially restored at high energies. 


For example, if the symmetry breaking introduces masses, or mass differences between 
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related particles, then the symmetry may become apparent at energies large with 
respect to those masses. Similarly, a hydrogen atom excited to an energy much larger 


than its lower energy levels will ionize to reveal its constituent particles. 


It often is possible to change to a set of variables that are invariant under a 
local symmetry. (We saw the analog for the global case when considering translation 
invariance in subsection IA1.) For example, if we can define everywhere a variable that 
transforms as 6¢(x”) = A(x), then it can be used to everywhere undo the invariance. 
We can choose the “gauge” A = —@, transforming ¢ to 0 everywhere, leaving no 
residual invariance, or we can work with composite, invariant variables: E.g., w’ = 
we is replaced (invertibly) with ~ = we7?, so yy’ = w. 


1. Spontaneous breakdown 


We first consider symmetry breaking by the vacuum, known as “spontaneous 
breakdown”. The action is invariant under the symmetry, but the vacuum state is not: 
Thus, the symmetry acting on the vacuum produces other zero-energy solutions to 
the field equations, but this symmetry is not apparent when considering perturbation 
about the vacuum. In this case, although the symmetry is broken, there are obvious 
residual effects, particularly if the breaking can be considered as “small” with respect 


to some other effects. 


The “Goldstone theorem” is an important statement about the effect of symmetry 
breakdown: If a continuous global symmetry is spontaneously broken, then there is a 
corresponding massless scalar. The proof is simple: Consider a (relative) minimum of 
the potential, as the vacuum. By definition, we have spontaneous symmetry break- 
ing if this minimum is not invariant under the continuous symmetry: i.e., applying 
infinitesimal symmetry transformations gives a curve of nearby states, which have 
the same energy, because the transformations are a symmetry of the theory. But 
the mass of a scalar, by definition, is given by the quadratic term in its potential, 
i.e., the second derivative of the potential evaluated at the vacuum value. (The first 
derivative vanishes because the vacuum is a minimum.) So, if we look at the scalar 
defined to parametrize this curve of constant energy in field space, its mass vanishes. 


(This field may be a function of the given fields, such as an angle in field space.) 


We can also formulate this more mathematically, for purposes of calculation: 
Consider a theory with potential V(¢'). (The Lagrangian is V plus derivative terms. 
For simplicity we consider just scalars.) The masses of the scalars are defined by 


the quadratic term in the potential, expanding about a minimum, the vacuum. The 
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statement of symmetry of the potential means that 


symmetry 6¢' = C'(¢) > 0=dV=COV forall ¢’ 


where we allow nonlinear symmetries, and 0; = 0/0¢'. Differentiating, and then 


evaluating at this minimum, 
(OV) =0 at minimum ¢ = (¢) 


=> 0= (0;(C'AV)) = ((A;C)(AV)) + (C'0,0)V) = (6°)(0:0;V) 


where here the vacuum value ( ) classically means to just evaluate at ¢ = (). (So 
classically (AB) = (A)(B).) Spontaneous symmetry breaking means the vacuum 
breaks the symmetry: If this symmetry is broken, then (¢*) 4 0, so it is a nontrivial 


eigenvector of (0;0;V) (the mass matrix) with vanishing eigenvalue. So, we can write 
P= (PY+XC) + 


where x is a massless field. 


The simplest example is a single free, massless field, V = 0. Then ¢ is simply a 


constant. The simplest choice of vacuum is just (¢) = 0, which breaks the symmetry: 
2 = 
L = +(0¢)’, d@ = constant, (o) =0 


Then ¢ is a “Goldstone boson”. 


The simplest nontrivial example, and a useful one, is a complex scalar with the 
potential 
V(o) = p"(I9)? — gm?)? 
This is invariant under phase transformations 6¢ = 7¢¢. There is a continuous set of 
minima at |¢| = m/V/2. We choose (¢) = m/,/2; then the Goldstone theorem tells 
us that the imaginary part of @ is the Goldstone field. Explicitly, separating the field 


into its real and imaginary parts, 
G=Fylmt tix) = Va gmp? t amd? $x?) + er"? $27) 


where (w) = (y) = 0. We could also use the nonlinear separation of the field into 
magnitude and phase, ¢ = (m + p)e’/./2: Then @ drops out of the potential, and 
its transformation (p is invariant) is the same as that of the free massless scalar. If 
@ had been real, then only the discete symmetry ¢ < —@ would have been broken, 


and there would be no Goldstone boson. 
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Exercise IVA1.1 


Write the complete action in terms of p and 0. 


Note that this model would naively seem to have a tachyon (state with negative 
(mass)*) if we had expanded about (¢) = 0. However, since the vacuum is defined 
always as a minimum in the potential (or the energy), the true states always have 
nonnegative (mass)”. This is the case for positive spins for similar reasons: We saw in 
subsection IIB4 that free massive theories follow from massless ones by dimensional 
reduction from one extra spatial dimension. If we had used an extra time dimension 
instead, as required for the “wrong sign” for the mass term in p? + m?, there would 
also be wrong signs for Lorentz indices, resulting in kinetic terms with arbitrarily 


negative energy. 


Spontaneous symmetry breaking will also affect the actions for fields other than 
those getting vacuum values, that couple to them. For example, terms of the form 
wf () will tend to generate a mass for ~ if (¢) 4 0 (actually f((¢)) 4 0). Such 
couplings exist for w of spins 0, 5; 1. Since masslessness is generally associated with 
symmetry (chiral symmetry for spin 4 and gauge symmetry for spin 1), this type of 
mass generation implies symmetries other than just those of the scalars are broken 


by this mechanism (see subsections IVA4-6). 


2. Sigma models 


The Goldstone mechanism thus produces massive particles as well as massless 
ones, at least for polynomial potentials, to which we are restricted by quantum con- 
siderations, to be discussed later. We now look for approximations to polynomial 
scalar actions that eliminate the massive fields, but still take them into account 
through their equations of motion, in the limit where their masses tend to infinity. 
For example, in the above simple model, we can take the limit 4 — oo, which takes 
the w mass (Am) to infinity. In this limit, the potential energy can remain finite only 
if it vanishes: |¢|? = $m”. (In quantum language, the potential’s contribution to the 
path integral is just 6({ V) in that limit. Alternatively, we can neglect the kinetic 
energy for |¢| in comparison to the mass or potential, and then eliminate |¢| through 
its equation of motion in this approximation.) We can also enforce this limit directly 


by using a Lagrange multiplier field A: 
L = 300)? + ACI? - dm?) 


The solution to the constraint is @ = wae and the action then describes just a free, 


real scalar 0. 
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A less trivial example is a nonabelian generalization of this example: Consider 
@ as a vector of an internal SO(n) symmetry. (The previous example was the case 
SO(2).) The Lagrangian is then 


L = 7(09)* + 3A(¢* — m*) 


The usual way to solve quadratic constraints without introducing square roots is to 


use the identity 
\(L+ ix)??? = (\L+ia)?)? = (2x)? + (1—2*)? = (142°)? 


This is often used for trigonometric substitutions or simplifying integrals. For exam- 
ple, when an integrand has a V/1 — z?, substituting x = sin @ eliminates the square 
root at the price of requiring trigonometric identities, which in turn are usually solved 
by making a second variable change to y = tan(@/2). On the other hand, the above 
identity suggests making instead the variable change x = 2y/(1+ y?), which actually 
gives the same result, more directly, as the previous two-step method. (This identity 
can also be used for finding integer solutions to the Pythagorean theorem: A right 
triangle with two shorter sides of integer lengths 2mn and m?—n? has the hypotenuse 


m? + n?, where m,n are integers.) 


We then can solve the constraint ¢? = m? with the coordinates for the sphere in 


terms of an SO(n—1) vector x, 


1,2 
~  to9X 

e=m (So. —) 

Then the kinetic term (now the whole action) becomes 


(Ox)? 
+(00)* = im ale 


Exercise IVA2.1 
For SO(3), express x in terms of the usual spherical polar angular coordinates 


6 and y, along with the inverse expressions (@ and y in terms of y). 


Another way to obtain this result is to use the solution of subsection IA6 to the 
constraint 


0=Y/ =P y = gy=e@",1,52°) 


(but now (y*)? is positive definite). Then the desired constraint 


(y*)? +(y*)? =1 
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follows from further constraining 


a0 el 1,2 _ 
l=y =e (1+ 92°) = eT ig 
Qdx? 
= dy=edr? = 
: (+ ap 


yielding the above result for « = x/V/2. We thus have a nonpolynomial action, each 
term having derivatives. The original SO(n) symmetry is nonlinearly realized on the 
“angle” variables y, and the vacuum ((y) = 0) spontaneously breaks the symmetry 
to SO(n—1). The constant m acts as a dimensionful coupling, as seen by scaling 


x — x/m to give the kinetic term the standard normalization. 


A complex generalization of this model is described by the Lagrangian 
L = 3|V¢l’ + A(\9l? — m*) 


where ¢ is now a complex n-component vector, V is a U(1)-covariant derivative 
(V@é = (0 +7A)¢), and A is a Lagrange multiplier enforcing that ¢ has magnitude 
m. This model thus has a U(n) symmetry. Since A has no kinetic term (F?), we can 


eliminate it by its algebraic field equation: 
L — 5/04? + gax(0'09)? + A(|b)? — m’) 


where we have applied the constraint |¢|? = m? (or shifted A to cancel terms propor- 
tional to |¢|? — m?). Since the U(1) gauge was not fixed yet, we still have local U(1) 
invariance even without an explicit gauge field. We can use this invariance to fix the 
phase of one component of ¢, and use the constraint from A to fix its magnitude. In 


terms of the remaining (n—1)-component complex vector y, 


b=m( x ae 
1+ $lxl?? 1+ 4|x/? 


a Ayn lOxP + a0 Ox)? 
= Csr ae)? 


(Alternatively, we can solve the constraint and fix the gauge first, then eliminate 
A by its field equation.) This model is known as the CP(n—1) model (“complex 
projective” ). 

Another example that will prove more relevant to physics is to generalize ¢ to an 


n®n matrix: We then consider the Lagrangian 


L = tr[3(0¢)! - (0¢) + 4°(¢'6 — bm?) 
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(where J is the identity matrix). Since ¢'¢ is hermitian and positive definite, the 


minimum of the potential is at ¢'¢ = sm? , and we can choose 
(6) = 24 
using the SU(n)®SU(n)@U(1) invariance 
¢ =U,dUR * 


(We can include the U(1) in either Ur or Ur.) The vacuum then spontaneously breaks 


this invariance to SU(n): 
($') =(¢) = U_,=Up 
In the large-mass limit, we get the constraint 
go=sm7l > o=3U, UU=I, U)=1, L— pm’tr[(OU)'(OU)] 


so the field U itself is now unitary. 


3. Coset space 


The appearance of the scalar fields (Goldstone bosons) as group elements can 
be generalized directly in terms of the effective theory, without reference to massive 
fields. Such a theory should be considered as a low-energy approximation to some 
unknown theory. Although the unknown theory may be better behaved at high 
energies quantum mechanically (see later), the low-energy effective theory can be 
determined from just (broken) symmetry. We therefore assume a symmetry group G 
that is broken down to a subgroup H by the vacuum. (l.e., the vacuum is invariant 
under the subgroup H, but not the full group G.) We are interested in only the 
Goldstone bosons, associated with all the generators of the group G less those of 
H. These fields are thus coordinates for the “coset space” G/H: They correspond to 
elements of the group G, but elements related by the subgroup H are identified. 


Explicitly, we first write the field g as an element of the group G, either by 
choosing a matrix representation of the group (as in the U(N) example above), or 


explicitly expanding over the group generators Gy: 
g=e*, b=¢'(2)Gr 


We then “factor” out the subgroup H by introducing a gauge invariance for that 


subgroup: 


gi =gh, hae ot 
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in terms of the H generators H,, which are a subset of Gy: 
Gr — (Heil; ) 


where T; are the remaining generators, corresponding to G/H. In particular, we can 


choose 


gauged’ =0 => G=¢'T; 


However, G should still be a global invariance of the theory, though not of the vacuum. 


We therefore assume the global transformation 


g = 909 


where go is an element of the full group G, but is constant in x. The vacuum 


(g)=1 


is then invariant under the global subgroup go = h~', where thus h is constant and 


go EH (i.e., G is spontaneously broken to H). 


g can be used to convert any representation of the global group G into one (but 


usually reducible) of the smaller local group H: 
V=gp >= vegid, V=h ry 


We can apply a similar procedure to find a field strength for g, invariant under the 


global group, as an element of the Lie algebra of G: 
g'Oag = 0, + iALH, +iF'T;, = Va tik’; 


This can be evaluated in the @ parametrization as multiple commutators, as usual: A 
and F are both nonpolynomial functions of @, but with only one derivative. We have 
absorbed A into a covariant derivative V because of the remaining transformation 


law under the local group H: 
(V+iFY =hUV+iF)\h => W=h'Vh, F'=h'FhA 


where we have assumed |H,,7;] ~ T;. (In particular, this is true for compact groups, 
where the structure constants are totally antisymmetric: Then fig; = 0 => frix = 0.) 


Then the action invariant under global and local transformations can be chosen as 


L = jm’tr(F”) 


A. HIDDEN SYMMETRY 239 


For example, the real vector model we gave in the previous subsection describes the 
coset space SO(n)/SO(n—1), the complex vector describes SU(n)/U(n—1), and the 
matrix model describes U(n)®U(n)/U(n). 


Exercise IVA3.1 
Use the coset-space construction to derive the specific o models explicitly 


given in the previous subsection, as just identified. 


a Find the real and complex vectors by dividing up the adjoint representation 


into appropriate blocks. 


b For the case of U(n)@U(n)/U(n), the direct product means we use separate 
group-element fields for the two global groups, with 


G, = 909th, — Ip = Gro9rh 
for the same h. Find an expression for the field U of the previous subsection 


without breaking any global or local symmetries. 


Note that the field redefinition between the G-representation matter field w and 
the H-representation matter field ¢) modifies the form of the couplings. For example, 
the kinetic term for w will have ordinary partial derivatives 0, while that for w will 
have covariant ones V. (One or the other will also have F’ terms.) On the other 
hand, a mass term for q) may turn into a potential/Yukawa term for w, since the 
larger group G might not allow mass terms permitted by the smaller group H. The 
result is that what appears as a nonderivative coupling in terms of ~ may appear as 


a derivative coupling in terms of q. 
We can formulate general spontaneous breakdown in this language: 


(1) Start with a polynomial action with symmetry G, including scalars @ that will 


suffer the breakdown through expectation, and other fields w. 


(2) Introduce an appropriate g and define the new scalar fields dé = q~'¢, as well as 
the new matter fields » = g~!wW. Thus S[¢, vw] Sd, 0,9]. In terms of these 


new fields, the action has a local symmetry H, and G now acts only on g. 


(3) (H-covariantly) constrain ¢ in such a way that g effectively replaces the missing 
parts. Then g describes all the Goldstone bosons, while the reduced ¢ describes 
the other scalars in the original @ (in the previous examples, the massive ones, 
which decoupled at low energies). 

For example, for the SO(n) model, ¢ is an n-vector, while g parametrizes the coset 

SO(n)/SO(n—1), and thus has n(n—1)/2 — (n—1)(n—2)/2 = n—1 non-gauge compo- 

nents — it is an n—1-vector under H. Thus for d, which is n>n—1@1 under H, we 


just constrain the n—1 part to vanish. 


240 IV. MIXED 


We have described how nonpolynomial actions quadratic in derivatives can arise 
as a low-energy approximation to polynomial theories. Further nonpolynomial terms 
quartic in derivatives (but no more than quadratic in time derivatives) can be useful 
for certain applications, but these arise from polynomial actions quadratic in deriva- 
tives (which are preferred for quantum reasons) only by quantum effects. One use is 
in models which describe (pseudo)scalar mesons by fundamental fields (i.e., solutions 
to the free field equations, which yield interacting solutions through perturbation the- 
ory), but baryons by nonperturbative solutions to the field equations of these scalars. 
Such an interpretation is suggested by an expansion in 1/N, where N is the number 
of colors, since a baryon is made of N quarks (whereas a meson contains just one 
quark and one antiquark). Such models are useful for describing static properties 
of baryons (masses, quantum numbers), but the complexity of such solutions to the 
field equations prevents their use for interactions of baryons (especially with other 


baryons). 


4. Chiral symmetry 


Later we’ll examine a description of the strongly interacting particles ( “hadrons” ) 
in which they are all considered as composites (bound states) of fermionic “quarks”. 
However, this theory is extremely difficult to solve, so we first consider treating the 
hadrons as fundamental instead. Since there are probably an infinite number of kinds 
of hadrons (or at least some integer power of 10*°, considering the (Planck mass)?), 
this would require a formulation in terms of a “string” that treated all “mesons” 
(bosonic hadrons) as a single entity. That possibility also will be considered later; 
for now, we look at the simpler possibility of studying just the low-energy physics of 
hadrons by using fields for just the lightest particles. 


So far, the only observed scalar particles have been strongly interacting ones. 
Some of the scalar mesons, especially the “pions”, are not only the lightest hadrons, 
but can be considered close to massless on the hadronic scale. We therefore look for 
a description of pions (and some close relatives) in the massless approximation; then 


mass-generating corrections can be considered. 


Normally, quantum corrections can affect masses. The only way to guarantee 
masslessness at the quantum level is through some symmetry; we then can study this 
symmetry already at the classical level. We have seen that (unbroken) gauge invari- 
ance can require masslessness for all fields except the scalar and spinor. Masslessness 
for a spinor can be enforced by “chiral symmetry”: If there is a U(1) symmetry for 


all irreducible spinors 7, then no mass terms (bilinears ~f'w2_.) can be constructed. 
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(Generally, each spinor can have different U(1) charges, as long as no two charges 
add to zero. Of course, this U(1) can be a subgroup of a larger chiral symmetry 
group.) The only way a scalar can be guaranteed masslessness is if it is a Gold- 
stone boson. We therefore look for a description of pions as Goldstone bosons of 
some spontaneously broken symmetry. (Supersymmetry is another possibility to en- 
force massless scalars, but only if there are also massless fermions, which is not the 
case for hadrons.) Furthermore, pions and the other lightest scalars are actually 
pseudoscalars: This suggests that they are the Goldstone bosons of broken chiral 
symmetry, which simultaneously generates masses for the fermions. 


For simplicity, we consider the coupling of scalar mesons to quarks. We could 
instead couple mesons to “baryons” (fermionic hadrons), thus treating only hadrons, 
but the principles would be the same, only the indices would be messier. Combining 
C invariance with chiral symmetry, and including a meson potential for spontaneous 


symmetry breaking, we can write the action for just the quarks and scalar mesons as 


S= fact 


L= [a'i0" x42 si dp 108° q* ral i [3(0¢)' - (06) + (ob! — gm*I)’] 
+ManOdha + PROG ia] 
where ¢ is an m®m matrix (m “flavors”), gq, and gr are n®m matrices (n “colors” ), 


and A is the “Yukawa coupling”. Sometimes it will be convenient to drop Lorentz 


indices to emphasize internal symmetries: 
L = (q'1idgug + aRidg* nr) + [3(04)' - (89) + 7’(o'd— 3m*1)"] + A(qaR + aRr*o"q'7) 


Besides color symmetry (local if we had bothered to write in the Yang-Mills fields for 
the “gluons”, by 0 — V on the quarks), we have the (global) U(m),;@U(m) chiral 


(flavor) symmetry 
dp=aU1, de= rUr*, ¢ =U_'dUR 
including the (global) U(1) “baryon number” symmetry 
= _ 10 1 _ io r - 40 po 
U,p=Up=e" > Gdi=e'an, In=e dr, GP =O 


If we think of baryon number as an SO(2) symmetry, then charge conjugation is 


just the reflection that completes this to an O(2) symmetry (see exercise IIA1.2): 


C: qed, e>¢ 
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From this, and the usual CP 
CPs GG", Gaog oe 
we find the parity symmetry 
P:  a+ar*, ¢-¢! 


(where for CP and P we also transform the coordinates as usual). 


As before, the vacuum (¢) = al breaks the flavor symmetry to the diagonal 
subgroup U; = Ur, which commutes with parity (and is therefore no longer “chiral” ). 
It also gives masses to the quarks (since chiral symmetry is broken); this is a general 
feature of spinors coupled to scalars under spontaneous breakdown. In the limit 
A — co (where the mass of all bosons but the Goldstones becomes infinite, but the 
quark mass M = Am is fixed), the Goldstone bosons are described by the unitary 


matrix U, which transforms as U’! = U;'UUp. 


Exercise [VA4.1 


Rewrite this action according to the analysis of exercise [VA3.1b: 
a Separate the Goldstone bosons from the massive scalars. 
b Replace the G-representation quarks with the H-representation quarks. 


An interesting special case is m=1 (one flavor). The Goldstone boson of axial 
U(1) can be identified with the 7°. In the limit A — oo, the Lagrangian becomes 


(with a tr no longer needed) 
L= (q'3i0" san, + dp 108° q* ra) - 4m? (Or)? - we (ed Aha + eg as) 


writing a for the neutral pion field. A = M/m is still the coupling of the pion to 
the quarks, as can be seen by rescaling 7 — 7/m to give the kinetic term the usual 
normalization. (The coupling m is known as the “pion decay constant” , and is usually 
denoted f,. If we include leptons with the quarks, then this coupling also describes 


the decay of the pion into two leptonic fermions.) 


In this case, the (broken) axial U(1) transformations are 
q=e"q, dr=e'gr, 1 =7—20 
The corresponding axial current (determined, e.g., by coupling a gauge vector) is 


Te = (Chat — deg p) — On 


A. HIDDEN SYMMETRY 243 


This current is still conserved, since the field equations aren’t changed by the prop- 
erties of the vacuum. The linear term is characteristic of expanding the Goldstone 
field about the spontaneously broken vacuum; it corresponds to the fact that that 


field has an inhomogeneous transformation under the broken symmetry. 


However, in reality the pion is not exactly massless, so we should add to the 
previous action a mass term for the pion, which explicitly violates the symmetry. (It 
is then a “pseudogoldstone boson”.) In the general chiral symmetry model, where the 
Goldstone bosons are described by a unitary matrix, a simple term that gives them 
masses while preserving the polar (parity-preserving) diagonal symmetry U; = Up of 


the vacuum is, for some constant €, 
Lm = —€ tr(+ of — Vm!) 


Since this explicitly breaks the axial U(m) symmetries, the corresponding currents 


are no longer conserved. In the U(1) case, we can also add just a mass term 


(for some constant ¢), which is the leading contribution from the general term above. 


The change in the field equation for 7 now violates the conservation law as 


This explicitly broken conservation law is known as “Partially Conserved Axial Cur- 
rent” (PCAC). 


5. Stuckelberg 


By definition, only gauge-invariant variables are observable. Although in general 
a change of variables to gauge-invariant ones can be complicated and impractical, 
there are certain theories where such a procedure can be implemented very simply 
as part of the normal gauge-fixing. Not surprisingly, the only nonlinearity in these 


redefinitions involves scalars. 


The simplest cases of such redefinitions are free theories, and are thus contained 
in our earlier discussion of general free, massive gauge theories. The simplest of these 
is the massive vector. As described in subsection IIB4, the Lagrangian and gauge 
invariance are 

L = iF? + 4(mA + 06)" 


5A=—-O, d56=m) 
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where F,, is the Abelian field strength. Note that the scalar is pure gauge: It is 
called a “compensator” for this gauge invariance. Since it has a nonderivative gauge 
transformation, it can easily be gauged to zero at each point, by just choosing \ = 
—g/m. This means that without loss of generality we can consider the theory in 


terms of just the gauge-invariant field 
A=A+t -0¢ 


This “composite” field can also be considered as a field redefintion or gauge transfor- 


mation on A. The lagrangian simplifies to 
1pv2 , 1,2 are 
L = a -+ qm x 


Later we'll see that it is often more useful to keep @ as an independent field. 


Exercise IVA5.1 
Choose the gauge A° = ¢. Show that ¢ then can be eliminated by its equa- 
tion of motion, leaving only the transverse 3-vector A’, with Lagrangian 
—;A'(O—m?)A’. Show the relation to the lightcone gauge of subsection 
I1IC2, using the dimensional reduction langauge of subsection IIB4. (Hint: 
You might want to use the fact that f? @ f(m— 09) f/m using integration 
by parts.) 


The original Lagrangian can also be considered an unusual coupling of a massless 
vector to a massless scalar: Remember that the massless scalar is the simplest example 


of a Goldstone boson, with the spontaneously broken global symmetry 
6¢=€T¢, To= 1 


where we have defined the symmetry generator T’ to act inhomogeneously on ¢. We 
then couple the “photon” to this charge: After a trivial rescaling of the gauge field, 
L= ,F? + 4(V¢), V=0+AT 


8m? 


where m is the “charge” with which A couples to ¢, which in this case happens to 


have dimensions of mass. The electromagnetic current in this case is simply J = sVé4, 


whose conservation is the scalar field equation O¢ = 0 (with gauge-covariantized LD). 

Because the spontaneously broken symmetry of the corresponding Goldstone 
model is now gauged, expanding about (¢) = 0 is no longer a physical statement 
about the vacuum, since ¢ is no longer gauge invariant. (As we saw, we can even 


choose @ = 0 as a gauge condition.) Therefore, from now on, when we make a 
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statement such as “(¢) = 0” in such a case, it will be understood to refer to choos- 
ing ¢ = 0 as the value about which to perform perturbation expansions (e.g., for 


separating actions into kinetic terms and interactions). 


Note that the Stiickelberg action can be generated starting from the action with 


just A’, and performing a gauge transformation that is not an invariance: 
! / 1 
A’ > A’ + — 06 


Dropping the prime from A, this transformation is just the inverse of the one we used 
to eliminate the scalar. If we start from an action that has also a coupling of A’ to 


matter, we see that conserved currents decouple from @: 


[aro fas-d foods 


More precisely, if the only term in the action for vector + matter that is not gauge 
invariant is the vector mass term (4m?A”), then the above gauge transformation 


affects only that term. 


6. Higgs 


We have seen that spontaneous symmetry breakdown can generate masses for 
spinors. We also saw how a massless vector could become massive by “eating” a 
would-be Goldstone scalar, in the simplest case of a scalar without self-interactions. 
We'll now examine more interesting models: Yang-Mills theories, which describe self- 


interacting vectors, must couple to self-interacting scalars to become massive. 


We can expect, by considering the linearization of any Yang-Mills theory coupled 
to scalars, that we will need more scalars than massive vectors, since each vector needs 
to eat a scalar to become massive, and some scalars will become massive and thus 
uneaten. (Only would-be Goldstone bosons can be eaten, as seen by linearization to 
the Stiickelberg model.) For the simple (and most useful) example of U(n) for the 
gauge group, an obvious choice for the scalar “Higgs” field is an n@n matrix. (SU(n) 
can be treated as a slight modification.) The simplest such model is the one studied 
in subsection IVA2: We now consider one of the SU(n) symmetries (together with 
the U(1)) as the local “color” symmetry to which the Yang-Mills fields couple, and 
the other SU(n) as the global “flavor” symmetry (where we use the names “color” 
and “flavor” to distinguish local and global symmetries, not necessarily related to 


chromodynamics). 
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The Lagrangian for this “Gervais-Neveu model” is then 
L = tr[gaF? + 5(V9)! - (Vo) + g°(b'g — 5m7T)I 


where V = 0+7A, and A, and ¢ are n@n matrices (but A, are hermitian). Now ¢'¢ 


is gauge invariant (although not invariant under the flavor group), so we still have 
(g'd) = gm’I 


as a gauge-invariant statement (but (¢) = al, or (o¢') = 4m?I, still makes sense 


only for purposes of gauge-dependent perturbation expansions). 

Since any complex matrix can be written as ¢ = UH/\/2, where U is unitary and 
H is hermitian, we can choose the “unitary gauge” U = I (ie., ¢ = ¢"). As for the 
Stiickelberg case, this is equivalent to working in terms of the gauge-invariant fields 


(defined by using this U as a gauge transformation) 
epi — ig _fpy-l 
A’ = U~*(—id + A)U, ¢ =H =U @ 
where U can be defined by 


SzH = Void, U=$V2H™ 


This is well-defined as long as H is invertible, which is true for small perturbations 
about its vacuum value 
CH y= ont 


If the perturbation is so large that H has vanishing eigenvalues, then this is equivalent 
to looking at states so far away from the vacuum that some of the broken symmetry 


is restored. Expanding about the vacuum (H — mI + H), the Lagrangian is now 
L=trlggF? + 4m? A? + 40H)? + Pm? H? 


+A’. +(HiOH) + 5mA7H + 5\°mH? + $A°H? + EH 

Thus all particles are now massive. As for the Goldstone case, we can take the limit 
A — oo to get rid of all the massive scalars, which in this case leaves just the massive 
vectors, adding only the mass term to the original Yang-Mills action. This was clear 
from the nonlinear 0 model that resulted from that limit, by coupling that field (U) 


to Yang-Mills directly. 


Exercise IVA6.1 
Find the chiral action for this model of the type described in subsection IIIC4, 


where the massive vectors are described by self-dual tensors instead of vectors. 
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Exercise IVA6.2 
Consider again this model, for the case n=2. We modify this example by 
dropping the U(1) gauge field, so we have just SU(2). Since SU(2) is pseu- 
doreal, we can further restrict the Higgs field to satisfy the reality condition 
o* = C¢C. Thus, both color and flavor groups are SU(2), and ¢ is the usual 
matrix representation of the 4-vector of SO(4)=SU(2)@SU(2) (see subsection 
IIA5). Repeat the analysis given above. 


Exercise IVA6.3 
Consider again the gauge group SU(2), but now take the Higgs field in the 
adjoint representation, with no flavor group (i.e., a real 3-vector). Show that 
only 2 of the 3 vectors get mass, leaving a residual U(1) gauge invariance. 
Explain this in terms of the gauge transformations of the 3-vector. (Hint: 
think 3D rotations.) 


7. Dilaton cosmology 


Some of the ideas in general relativity can be introduced by a simple model that 
involves introducing only a scalar field. Although this model does not correctly de- 
scribe gravitational forces within our solar system, it does give an accurate description 
of cosmology. The basic idea is to introduce a dynamical length scale in terms of a 


real scalar field ¢(x) called the “dilaton” by redefining lengths as 
—ds? = dz™dx"¢(r)Nmn 


(Squaring ¢ preserves the sign of ds?; we assume @ vanishes nowhere.) As explained 
in our discussion of conformal symmetry, this field changes only how we measure 
lengths, not angles (which is why it is insufficient to describe gravity): At any point 
in spacetime, it changes the length scale by the same amount in all directions. In 
fact, it allows us to introduce conformal invariance as a symmetry: We have already 
seen that under a conformal transformation the usual proper time of special relativity 
changes as 
ae ae ge Eade de Tay 


Thus, by transforming @ as 
(2) = (E(x) (a) 


we have 


ds’? = ds? 
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for our new definition above of proper time. This transformation law for for the 
dilaton allows any Poincaré invariant action to be made conformally invariant. This 


definition of length is a special case of the general relativistic definition, 
=I = d0" de Galt) SS Bin =O ee 


The action for a particle is easily modified: For example, 
5. = fe s(um? —v'#) fer slum? — uv"? (x) a7] 


since £2 = dx?/dr? (or by using our previous coupling to the metric tensor gmp). It 


is convenient to rewrite this action by redefining 


u(r) > v(7)¢*(x(7)) 


The resulting form of the action 
Ss, > fe s(um?¢?(x) — v*4?) 


makes it clear that there is no change in the case m = 0: A massless (spinless) 
particle is automatically conformally invariant. We have seen this action before: It is 
the coupling of a massless particle to an external scalar field 4m?¢?. (What we call 


the scalar field is irrelevant until we write the terms in the action for that field itself.) 


Exercise IVA7.1 


Let’s examine these actions in more detail: 


a Find the equations of motion following from both forms of the particle action 
with background dilaton ¢(x). 

b Find the action that results from eliminating v by its equation of motion from 
both actions for m 4 0, and show they are the same. 

c By a different redefinition of v, find a form of the action that is completely 
linear in @. 


The corresponding change in field theory is obvious if we look at the Hamiltonian 


form of the particle action 
Sy o> [ar-a"on + v3(p? +m’¢d’)| 


Using the correspondence principle, we see that the Klein-Gordon equation for a 
scalar field w has changed to 
(O — m*¢*)y = 0 
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The corresponding modification to the field theory action is 
Sof aPa (Gu)? + mtgur 


Since conformal invariance includes scale invariance, it is now natural to associate 
dimensions of mass with ¢ (or inverse length, if we do classical field theory) instead 
of m, since in scale invariant theories all constants in the field equations (or action) 
must be dimensionless (otherwise they would set the scale). Similarly, this makes ds? 


dimensionless, reflecting the fact that it is now scale invariant. 


Since this is supposed to describe gravity, at least in some crude approximation 
that applies to cosmology, where is the (Newton’s) gravitational constant? Since @ 
must be nonvanishing, “empty space” must be described by ¢ taking some constant 


value: We therefore write 


where “( )” means vacuum value, or asymptotic value, or weak-field limit (the value 
@ takes far away from matter). (We have chosen an extra factor here of 7 in the 
definition of Newton’s constant G for later convenience, so we effectively use units 
G =. Its normalization can’t be determined without introducing true gravity. 
Similarly for the 3, which simplifies things for cosmology, but differs from our later 
conventions.) Thus, the usual mass in the Klein-Gordon equation arises in this way 
as my/3/k. The dilaton @ is thus defined as the field that spontaneously breaks 
scale invariance, and also as its Goldstone boson. Unfortunately, things are more 
complicated in cosmology, since then ¢ is time dependent, even though it’s not space 
dependent. But physical quantities are scale invariant, just as they are rotationally 
and translationally; thus only (dé/dr)/¢ (the “Hubble constant”: see below) and its 


7 derivatives are measurable. 


In natural (“Planck”) units « = 1 (i.e., G = 7; or some other convenient value): 
Fixing c = h = « = 1 completely determines the units of length, time, and mass. 
These units are the convenient ones for quantum gravity; they are also the most 
obvious universal ones, since special relativity, quantum theory, and gravity apply to 
everything. However, they are presently impractical in general, since the gravitational 


constant is not so easy to measure: Its presently accepted value is 
G = 6.6742(10) x 107 ''m?kg"ts~? 


(where the numbers in parentheses refer to errors in the last digits), which is accu- 


rate to only a few parts per 10,000, compared to the standard atomic and nuclear 
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constants, which are known to a few parts per 100,000,000. On the other hand, cos- 
mological measurements are even less accurate, so we can use them there: The orders 


of magnitude seem inappropriate, but interesting. 


In relation to standard units, the Planck units (adjusted to our units G = 7) are 


Gh 

= 9.11867(69) - 10-*°m 
m3 
Gh 
— = 3.04166(23) - 10-“*s 
TO? 
hen 28 
ere 85786(29) - 10~8kg 


Exercise IVA7.2 
There is another Planck unit, for temperature. Evaluate it in standard units 


(Kelvins) by setting to 1 the Boltzmann constant k. 


We have yet to determine the action for ¢ itself: We write the usual action for a 
massless scalar in D=4 (for other D we need to replace @ with a power by dimensional 


analysis), up to normalization, 


oe - f de Lag)? 


but we have written it with the “wrong” sign, for reasons we cannot justify without 
recourse to the complete theory of gravity. However, without this sign change we 
would not be able to get cosmological solutions with positive energy density for source 


particles (matter and radiation without self-interaction). 


To a good approximation the universe can be described by a spacetime which is 
(spatially) rotationally invariant (“isotropic”) with respect to a preferred time direc- 
tion. Furthermore, it should be (spatially) translationally invariant (“homogeneous” ), 
so the dilaton should depend only on that time coordinate. We therefore look for so- 
lutions of the equations of motion which depend only on time. Thus the proper time 
is given by 

—ds’ = ¢° (é)[—-dé + (dz’)”| 
By a simple redefinition of the time coordinate, this can be put in a form 


—ds? = —dr* + ¢?(r)(dz')’ 


where by “(7)” we really mean “¢(t(7))”, and the two time coordinates are related 


by 
=i oa => r= f dt ott) or t= fars a CGy 


A. HIDDEN SYMMETRY 251 


In this latter form of ds we can recognize T as the usual time, as measured by a clock 
at rest with respect to this preferred time frame. It will prove convenient to calculate 
with time t, so we will work with that coordinate from now on, unless otherwise 
stated; in the end we will transform to 7 for comparison to quantities measured by 


experiment. 


To a good approximation the matter in the universe can be approximated as a 
“dust” , a collection of noninteracting particles. It should also be rotationally invariant 
with respect to the preferred time direction, so the momenta of the particles should 
be aligned in that time direction. (Really it is this matter that defines the time 
direction, since it generates the solution for ¢.) Furthermore, the dust should be 
translationally invariant, so all the momenta should be the same (assuming they all 
have the same mass), and the distribution should be independent of time. (Here, 
unlike general relativity, we do not think of spacetime itself as changing: We treat 
spacetime as ordinary Minkowski space, and @¢ as another field on it.) Varying the 


Hamiltonian form of the action for a single particle with respect to ¢, we find 


65m _ 4p —X 
ieee) m fer vo (x ) 


(We briefly use 7 again for the worldline parameter, not to be confused with the 
physical time 7 just introduced.) Using the equations of motion following from that 


action, we also have 


umd = Va = 


dt 
dt 


where we have used dx’ = 0 for this dust, and the fact that v,m, ¢ are all positive by 
definition. We thus have 59 
M 3 
— = m0°(4 — X 
p(y) 


We can compare this to the energy density, derived as in subsection IIIB4 (since the 


X? term in the action, which would contain the metric, is unmodified): for matter 
= fe O(2 —X)u 1? = fe 6*(24 — X)um?¢? = mod?(x — X) 


as we could guess from dimensional analysis. The relation between these 2 quantities 
is no accident: Our original introduction of ¢ was as Gmn = ¢7mn. If we introduce 
both ¢@ and metric independently, so as to calculate both of the above quantities, in 


the combination ¢2gmn, then we automatically have 
5S 5Su : 
= 29mnz—— = —I um 


ba ON ae 


which is T° in this case (since the other components vanish). 
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Of course, the dust consists of more than one particle: It is a collection of parti- 
cles, each at fixed x’. That means we should replace 6°(x2 — X) with some constant, 
independent of both x’ (because of homogeneity) and t. (Because of isotropy, the 
particles don’t move. In this interpretation of the expanding universe, we thus have 
“static” particles whose separation increases: Although x’ is constant for them, dis- 
tance is measured with an extra factor of ¢.) Actually, we need to average over 


particles of different masses: The result is then 


0S 00 
— =4, a, =e 
60(2) M @ 
for some constant a. The equations of motion for ¢ are now very simple; since 0;¢ = 0, 
we now have simply 
b= 
where the dots now refer to t derivatives. If we take this equation and multiply both 


sides by ¢, we get an obvious total derivative. Integrating this equation, we get 
20° = ab+ gb 


for some constant 4b. This equation has a simple interpretation: Recognizing ad as 
the energy density T?? of the dust, and —1¢? as the energy density of @ (from our 
earlier discussion of Hamiltonian densities), we see it implies that the total energy 


density of the Universe is a constant. 


We can also identify the source of this constant energy: We evaluated the en- 
ergy density of dust and its coupling to @. However, there can also be radiation: 
massless particles. As we saw, massless particles do not couple to ¢. Also, we have 
neglected any interaction of particles with each other. Thus massless particles in this 
approximation are totally free; their energy consists totally of kinetic energy, and 
thus is constant. (They also move at the speed of light, so components of T@ other 
than T° are nonvanishing. However, we average over massless particles moving in 
all directions to preserve isotropy.) Therefore we can identify the energy density for 
radiation, 

TH — 1p 


Exercise IVA7.3 
Consider general forms of the energy-momentum tensor that have the right 
symmetry: 
a Show that the most general form that has spatial isotropy and homogeneity 
is 
T™ = p(t)u™u” + P(t)(n™ + u™u”), u” = of 
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(or the equivalent). p is the energy density, while P is the pressure. This 


general form is called a “perfect fluid” (e.g., an ideal gas). 


b Show that the equation of motion for @ and energy conservation are now 
oo=p-3P, 30° =p 
Relate pressure to energy density for radiation by using the fact that it doesn’t 


couple to @. 


c Derive from these the “covariant conservation law” 
op + (3P — p) =0 


These equations are easily solved. There is an unavoidable “singularity” (the 
“Big Bang”) ¢ = 0 (all lengths vanish) at some time: Imposing the initial condition 
6(0) = 0 (ie., we set it to be t = 0) and (0) > 0 (so ¢ > 0), 


o = akt? + Vbt 
The “physical” time coordinate is then 
_ ere. 142 


Since ¢ can’t be expressed simply in terms of 7, we use the expressions for both in 


terms of t. Simple expressions can be found for a = 0 (¢@ ~ /T) and b= 0 (¢~ 77/9), 


For the case of pure matter (b = 0), the energy conservation equation written in 


terms of the tT coordinate becomes, using dt = ¢ dt, 


do : a 
ie) - 37" 


This is the same as the Newtonian equation for the radial motion of a particle under 
the influence of a fixed point mass (or the relative motion of 2 point particles), with 


total energy zero. 


Since ¢ increases with time, distances (as measured by ds) between slowly moving 
objects (such as the dust particles, but also the stars and galaxies to which they are 
an approximation) also increase. This is true in spite of the fact that such objects 
are not moving with respect to the natural rest frame. The most obvious effect of 
this cosmological expansion is the cosmological “red shift”. The expansion of the 
universe causes photons to lose energy, including those of the black-body radiation of 


the universe as well as those emitted long ago from distant sources. 
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Since the Universe is approximately translation invariant in the spatial directions, 
spatial momentum p’ is conserved. (For example, vary the particle action with respect 


to z'.) This tells us nothing for the dust, but for the radiation we still have 
0 =p" = ey ees (p')? 


and thus F also is conserved. But this is FE as defined with respect to t, not T. 
(For example, it appeared in the action as tZ. Also, the above equation is for p™ = 
v_tdz™/dr with dx? = 0.) However, the time measured by clocks at rest is 7, and thus 
the energy E that is measured is with respect to 7. In terms of canonical conjugates 


as defined in a Lagrangian or Hamiltonian, we see this as 
iE=7E => E=¢'E 


using dr = ¢ dt. In particular, for the dust particles we have E=m. 


Actually, this is true for all components of the (4-)momentum: At any fixed 
point x”, we always choose coordinates near that point such that the proper time 
looks like the usual one, i.e., (2) = 1. This can always be accomplished by a 
scale transformation: Since we have conformal invariance, we are allowed to choose 
a reference frame by not only choosing an origin (translation) and orientation of the 
axes (Lorentz transformation), but also the scale (and even acceleration, via conformal 
boost). Rather than make this scale transformation explicitly, we simply note that 


the measured momentum is actually 
p™ = db tp™ 


For example, for massive particles we then have p? + m? = 0. 


Since E is conserved but E is measured, we thus have E ~ ¢~!. Therefore, 
observers measure the photon’s energy, frequency, and corresponding black-body ra- 
diation (whose distribution depends only on energy/temperature) as having time 
dependence ~ ¢~! (and wavelength as ¢). The spectrum of radiation emitted by a 
distant object is then shifted by this energy loss, so the amount of shift determines 


how long ago it was emitted, and thus the distance of the emitter. 


Similar remarks apply to observed energy densities: When using variations with 
respect to external fields, we used 64(a2 — X)’s: For the observer’s coordinates, this 
will be multiplied by ¢~4 (since dx is multiplied by ¢). Thus the observed energy 
density is 

p=T =T%g-4 = ad + lyg-4 
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Astronomers use (at least) 3 parameters which are more directly observable. The 
“size” of the Universe ¢ is coordinate dependent, but we can measure the change 
in time of this scale through red shifts: Comparing lengths at different times, we 
measure (72) /(71), more conveniently represented in terms of the difference of the 


In: In terms of the derivative, we have 


ae 


or 


The “Hubble constant” H (constant in space, not time) measures the expansion rate, 
and gives an inverse length (time) scale. Thus it is not predicted, but determined from 
observations. As for all cosmological quantities, it is difficult to measure, its value is 
based on various astrophysical assumptions, and its quoted value has changed often 


and by large amounts over the years. A recent estimate for its present value is 
H~' = 13.8(7) - 10° yrs. 
In “natural (Planck) mits,” c= G/r=fh=1, 4 = 143(7) x10". 
We can also define a dimensionless “(energy) density parameter” 2 by using H~! 
as a length scale: However, in the simplified model we have used, it is already fixed 
_ 2% 
~ OH? 
(Sometimes the parameter 0 = {2/2 is used instead.) Note that in our conventions 


spatial integrals are weighted as [ d?-!x/(2r)?/?; thus the relation of our density to 


=I 


the more standard one is 

8 

3 GPusual 
H2 


where G = 7 in our conventions (but sometimes G = 1 is useful, especially for 


p = (29)? Pasa => Q= 


solutions describing stars and planets). In the more general (relativity) case, this 
parameter measures energy density with respect to the amount needed to “close” the 
universe; in this case, it takes the “critical” value, bordering between open and closed. 
However, this value agrees with observations to within experimental error. This alone 
shows that the dilaton is sufficient to give an accurate cosmological model (although 


ingredients other than those discussed so far may be needed). 


The rate of change of the Hubble constant can be defined in terms of a dimen- 
sionless quantity by comparing its inverse with the true time: 
_ d(H") 
~ dr 
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or 


_ 6 d¢/dr? 

~ — (db/dr)? 
The “deceleration parameter” gq tells how fast the expansion rate is slowing down. In 
the case of pure dust g = 3, while for pure radiation g = 1; otherwise, it’s somewhere 


in between. 


Exercise IVA7.4 
Calculate H, q and 2 in terms of a, b, and t. 


Recent supernova observations (together with the assumption of the supernova 
as a “standard candle”) indicate that q is negative: The expansion is accelerating. 
Although this “experimental” value is highly unreliable, and its estimate varies widely 
from year to year based on methods of measurement and choice of assumptions (as 
well as author), the existence of measurements indicating q < 5 suggests the above 
model of energy coming from just dust and radiation may be too simple. In fact, other 
observations indicate the vast majority of energy in the Universe (about 95%!) is not 
in any known form. While some forms of proposed missing matter (“dark matter” ) 
seem to fit into the above types (but are simply not observed by non-gravitational 
methods), others (“dark energy”) do not, and seem to form the majority of the 
missing energy. One simple remedy is to introduce a “cosmological constant” term 


(or its equivalent) into the action: In the language of the dilaton, it takes the form 
S A= A / d‘x p* 


where A is the cosmological constant. This term preserves conformal invariance. (Its 
scale invariance is obvious by dimensional analysis.) Unfortunately, it makes the 
dilaton field equation nonlinear, so we no longer have a simple closed solution as 
before. (Numerical methods are required.) Furthermore, the observed value of this 
constant corresponds to a length scale of the order of the size of the observed Universe. 
While this can be explained for the Hubble constant, since it varies with time, there 
is no “natural” way to explain why a true constant should just happen to set a scale 
comparable to the present value of the Hubble constant (i-e., there is an unexplained 
10° floating around). One possibility is that it is dynamically generated as a vacuum 


value of another scalar field, and thus might vary with time. 


Exercise IVA7.5 
Show explicitly that the cosmological term is invariant under a conformal 
boost. 

Various early features of the universe are not well explained by the model pre- 


sented so far, in particular, why this model works so well, i.e., why the universe is 
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conformally flat. Furthermore, the observed isotropy of the universe suggests an early 
period of the universe where all parts of the (now-observed) universe were causally 
connected so that they could interact in a way to produce this homogeneity. (The 
universe as described above would expand too quickly for this to happen, at least for 
the observable part of the universe.) The details of this earliest era are not well un- 
derstood, primarily because they involve physics at the Planck scale. There are also 
many models available: The most popular class of models is “inflation”, the theory 
that the universe expanded more rapidly initially; another class considers the period 
before the Big Bang (which may be modified to be less or not singular). On the 
technical level, the necessary properties required for such conditions can be described 
most easily by introducing an extra scalar field (“inflaton”) whose changing vacuum 
value has the effect of a time-dependent cosmological constant. This field might be 
either fundamental or composite, or even represent modified dynamics of spacetime 
itself (by eliminating the inflaton by its equation of motion to modify the action of 
the dilaton: see exercise IXB5.4). Unlike the “dark energy” problem, which would ef- 
fectively modify gravity at the cosmological scale, this problem would modify gravity 
at the Planck scale. 
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sSayeehoneanesaueas B. STANDARD MODEL .................... 


In this section we discuss the “Standard Model”, the minimal theory that de- 
scribes all the observed particles and forces (except gravity). We also consider some 
features of “Grand Unified Theories” (GUTs), an extension of the Standard Model 


that uses fewer multiplets. 


1. Chromodynamics 


One way in which particles naively described by the action can be hidden from 
observation is if the force binding them is too strong to allow them to exist freely. 
Such a condition is often called “infrared slavery” since this alleged property of the 
force is a long-range feature, preventing the constituent particles from escaping to 
infinity. This “confinement” is not a classical phenomenon, and its occurence even 
at the quantum level has not yet been proven. Therefore, in this section we’ll simply 
assume confinement, and describe the resultant symmetry properties, leaving the 


dynamical properties for later chapters. 


The assumption of “color” confinement is that the color forces are so strong that 
they bind any objects of color to other such objects; thus, only “colorless” states, those 
that are singlets under the color gauge group, can exist freely. Composite fields that 
are invariant under the local group transformations can be obtained by multiplying 
matter fields or Yang-Mills field strengths, perhaps using also covariant derivatives, 
and contracting all color indices. The color gauge group is generally assumed to be 
SU(n): usually SU(3), but sometimes larger n for purposes of perturbation in 1/n. 
Larger n is also used for unification, but in that case the Higgs mechanism is used to 


reduce the group of the massless vectors to SU(3) (times Abelian factors). 


Another feature of these confined states, to be considered later, is their geomet- 
rical structure. The observed spectrum and scattering amplitudes of the “hadrons” 
(strongly interacting particles) indicates a stringlike identification of at least the ex- 
cited states. (The ground states may behave more like “bags”.) This picture also 
fits in with confinement, since the spatial separation of the quarks and antiquarks in 
excited states would force the gluons that convey their interactions (and self-interact) 
to confine themselves as much as possible by collapsing into “strings” connecting the 
quarks. Thus, we describe a meson with an “open string”, with a quark at one end 
and an antiquark at the other. Similarly, an excited glueball would no longer be a 


ball, but rather a “closed string”, forming a closed loop. 
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We will need to reconsider also the discrete symmetries, C, P, and T, and their 
combinations. First of all, we note the “CPT theorem”: All local, hermitian, Poincaré 
invariant actions are CPT invariant. This is easy to see from the fact that CPT only 
changes the overall sign of the coordinates, which is effectively the same as changing 
the sign of each derivative, as well as giving a —1 for each vector index on a field. Since 
CPT also gives signs to dotted spinors and not undotted ones, we also get —1’s for 
vector combinations of indices on spinors (qpoap®: signs cancel when contracting spinor 
indices on pairs of dotted or undotted spinors). Thus, all these signs cancel because 
Poincaré invariance requires an even number of vector indices (in even numbers of 
dimensions, from contracting with 7», and €qpea). Alternatively, and even more simply, 
in D=4 we can attribute it to having even numbers both of undotted spinor indices 
and of dotted spinor indices, since we can define CPT by associating a sign with each 
dotted index (including those that appear as part of a vector index). Consequently, 


from now on we ignore T and consider only C, P, and CP. 


Although we have considered C (and thus CP) in the context of electromagnetism, 
nonabelian gauge fields require some (simple) generalization, since they carry charge 
themselves. We start with the general coupling of massless fermions to nonabelian 


gauge fields: 
L = pl? (48,5 + A,g)b* 


where w is a column vector with respect to the gauge group, and A a hermitian 
matrix. The CP transformation of the fermions then determines that of the vectors, 
needed for invariance: 

CP: wt=yra, Wag, Oy =O, AL = ATP 
(remember (w%)* = ~%, but (Wa)* = —Wx because of the factor of Cag), where we 
have chosen to represent parity on the coordinates as acting on the explicit 0 rather 
than on the arguments of the fields. The transformation on the vector is thus parity 
on the vector index, combined with charge conjugation A’ = —A? = —A,*: The 
minus sign can be associated with change in sign of the coupling (as for the Abelian 
case), while the complex conjugation takes into account the charge of the vector 
fields themselves. (As discussed in subsection IB2, G — —G*® is an invariance of the 
algebra, where g — g* and g = e’@.) Although these terms, as well as the F? term 
for the vectors, are CP invariant, this invariance can be broken by coupling to scalars: 


The Yukawa coupling 
ly = wba + he. 
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for some matrix ¢ of scalars, would require the CP transformation 


(up to perhaps some unitary transformation), but unlike the vectors there is no guar- 
antee that under complex conjugation the matrix @ = ¢'M, for real scalars ¢' and 


constant matrix (Yukawa couplings) M; will preserve this form, i.e., satisfy 
o”M; = ¢°M;* 


since the matrices M; can be complex. 


The basic assumption of “chromodynamics”, or in the quantized version “quan- 
tum chromodynamics (QCD)”, is that we have a nonabelian gauge theory without 
fundamental scalars that couple directly (but scalars will show up when we intro- 
duce electroweak interactions). Namely, we assume only Yang-Mills for the “color” 
gauge group SU(n), specifically n=3, with the usual action, minimally coupled to 
spin-1/2 “quarks” in the defining representation of the color group, which may have 
masses. (These masses are actually generated by weakly interacting Higgs bosons, 
whose coupling we consider in subsection IVB2; for now we include just the result- 
ing mass terms.) Such an action is automatically invariant under CP and T. We 
furthermore assume invariance under charge conjugation: Just as an irreducible real 
scalar describes particles that are their own antiparticles, and needs doubling (or 
complexification) to define charge, an irreducible (massive) spinor cannot describe 
distinguishable particle and antiparticle. But the quarks come in the defining rep- 
resentation of SU(n), which is complex, and thus requires doubling to define mass 
terms. Therefore, for every quark field gra (“L” for “left”) we have an “antiquark” 
field gra (“R” for “right” ), and they transform into each other under charge conjuga- 
tion, just as a scalar transforms into its complex conjugate. (A spinor can’t transform 
into its complex conjugate under C, since C commutes with spacetime symmetries, 
like Lorentz transformations.) Besides this doubling, and the n colors of the quarks, 
we also assume a further multiplicity of m different “flavors” of quarks. Gauge invari- 
ance requires the quark masses be independent of color, and C invariance requires the 
mass terms couple qz to qr, but these terms violate an otherwise global U(m)®U(m) 


flavor symmetry. 


The action is then of the form 


tr[LF? + (q'niVan + a'niVar) + (Bahan + h.c.)] 
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where we have written gz and gz as matrices with respect to SU(n) color (U.) and 
U(m)@U(m) flavor (Ur, and Uy) such that they transform as 


gd, = UeqrU fz, dp = U-*aRU 7 R* 
and thus the covariant derivatives can be written as 
VadL = (Ou = tAa)qz, VadR = (Ou _ iAa*)qr 


where A, are hermitian, traceless, nxn matrices. (By definition, charge conjugation 


takes a representation of an internal symmetry into the complex-conjugate one.) 


While the color symmetry is a local symmetry, the flavor symmetry is broken, 


inducing the transformation on the mass matrix 
= 
M' = Us,r.MU TR 


This transformation allows the mass matrix M to be chosen real and diagonal: Any 
matrix can be written as a hermitian one times a unitary one. A Urr transformation, 
as a field redefinition, then can be made to cancel the unitary factor in M; then 
a unitary transformation Us; = Usz can be made to diagonalize M (while keeping 
it hermitian). These diagonal elements are then simply the masses of the m dif- 
ferent quark flavors. The most symmetric case is MM = 0, which leaves the entire 
U(m)@U(m) chiral symmetry unbroken. (See subsection [VA4.) The least symmet- 
ric case is where all the masses are nonzero and unequal, leaving as unbroken only 
the subgroup U(1)”, with Us, = Ure. (In general, Ur, = Usp if all masses are 


nonvanishing. ) 


Exercise IVB1.1 
Show the most general case is the product of U(N)’s for various subspaces, 


with 2 U(N)’s for the massless subspace. 


Since the above transformation allows M to be diagonalized, in particular it can 


be made symmetric, which is sufficient to define charge conjugation: 
Ci GLa ~ dRa, Ag _ —A,* 


Furthermore, since M can be chosen not only symmetric but real, in particular it can 


be made hermitian, which is enough to define parity: 
Ps dL. — IR,Las Aga SAP 


The minimal form of this action, besides making CP and T automatic, also au- 


tomatically extends the discrete symmetry C to an O(2) symmetry, whose “parity” 
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transformation is C and whose continuous SO(2)=U(1) symmetry is the U(1) part 
of the U(m) flavor symmetry, which is not broken by the mass term. It corresponds 
to a charge called “baryon number”: Up to an overall normalization factor, it simply 
counts the number of quarks gra, ra (which form a Dirac spinor) minus the number 
of “antiquarks” ¢ra, Gra. However, such an O(2) symmetry can be defined separately 
for each flavor, since (after MM has been diagonalized) the action can be written as 
a sum of independent terms for each flavor. In particular, each flavor has its own 
separately conserved quark number. These flavor conservation laws, at the classical 
level, are broken only by the weak interactions, which we have not included in the 


above action. (Gravity and electromagnetism do not violate them.) 


Since confinement is a quantum effect, the details of hadronic scattering cannot be 
discussed within classical field theory. However, we saw that low-energy properties of 
mesons (and similarly for baryons) could be described by effective Lagrangians. The 
fact that hadrons are made of quarks can be used to obtain a bit more information 
even at the classical level, especially if the relevant quarks are heavy. (Heavy with 
respect to what is unfortunately also a question that can be answered only at the 
quantum level.) For example, in a nonrelativistic approximation, low-energy proper- 
ties of hadrons can be found from just the quantum numbers, spin-spin interactions, 
and masses of the quarks, while their velocities are ignored, and the gluons are ne- 
glected altogether. In such an approximation, reasonably accurate predictions are 


made for the masses and magnetic moments of the ground-state hadrons. 


Actually, the claim that color nonsinglet states can never be observed needs a 
bit of stipulation: There may be a “quark-gluon plasma” phase of hadronic matter 
that can exist only at extremely high temperatures or pressures. Thus, a hypothet- 
ical observer during the first moments of the universe might observe “free” quarks 
and gluons. Similarly, a small enough observer, living inside an individual hadron, 
might see individual quarks and gluons, since the size of his equipment would be 
much smaller than what we consider “asymptotic” distances. Conversely, we could 
consider the possibility of a new chromodynamic force, other than the one respon- 
sible for the hadrons of which we are composed, that has a confinement scale that 
is astronomical (extremely low energy), so that earthly laboratories would fit inside 
the new “hadrons”. Thus, any statement about the observability of color must be a 
dynamical one, and does not follow as an automatic consequence of the appearance of 
a nonabelian group: Just as for the Higgs effect, confinement can be repealed under 
appropriate circumstances, and the observability of color depends on the details of 
the dynamics, and in particular on the values of the various parameters (momenta 


and couplings). 
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2. Electroweak 


The weak and electromagnetic interactions are mediated by observed spin-1 par- 
ticles, some of which have charge and mass. Specifically (see subsection IC4), the 
massive vectors form a triplet (W*, W~, Z), while there is only one massless vector 
(the photon). This suggests a gauge group of SU(2)@U(1), with a Higgs effect that 
leaves only U(1) unbroken. From the table of known fundamental fermions, we can see 
that they fall into doublets and singlets of this SU(2), with the U(1) charge being that 
of electromagnetism. (This SU(2)@U(1) unification of the weak and electromagnetic 


interactions is called the “Glashow-Salam-Weinberg” model.) 


We saw in subsection IVA4 a very simple model of spontaneously broken chiral 
U(m)®U(m) symmetry where masses were generated for quarks. In subsection IVA6 
we saw how the same scalars could generate masses for vectors, by coupling to one 
of the U(m)’s. We now combine those two models, specializing to the case m=2, 
but with two slight modifications: (1) Since the defining representation of SU(2) is 
pseudoreal, we can impose a reality condition on the Higgs field, which is in the (3,5) 
representation of SU(2)@SU(2): 

e* = COC 


This makes it a vector of SO(4)=SU(2)@SU(2) (See exercises ITA5.3 and IVA6.2.) 
It’s also the reality condition satisfied by an element of (the defining representation 
of) SU(2). (See subsection IIA2.) This is not surprising, since the group product 
U’ = U,UUR allows the interpretation of a group element itself as a representation 
of chiral symmetry. This is the situation described in subsection IVA2 (@ — U in the 
large-mass limit), but in this case '@ is automatically proportional to the identity 
(it gives the square of the 4-vector), so in general an SO(4) 4-vector can be written 
as the product of a scalar with an SU(2) element. This reality condition breaks the 
chiral U(1)@U(1) to the diagonal U(1) that leaves the Higgs invariant. 


(2) The gauged SU(2) is still one of the two chiral SU(2)’s, but the gauged U(1) 
must now be a subgroup of the other SU(2), since the Higgs is now invariant under 
the usual U(1)’s. Thus, the ungauged SU(2) is explictly broken, and this accounts 
for the mass splittings in the doublets of known fundamental fermions. Remember 
that observables are singlets of gauged nonabelian groups (except perhaps for Abelian 
subgroups), so any observed internal SU(2) must be a global symmetry, even when 
it’s broken. As described in subsection IVA6, these singlets can be constructed as 
composite fields resulting from the gauge transformation obtained from the SU(2) 


part of ¢. 
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Using the electromagnetic charges of the various particles, we thus determine 
their SU(2)@U(1) representations: For spin 1, we have W=(1,0) and V=(0,0), where 
the first entry is the “isospin” and the second is the U(1) charge. For spin 0, we 
have ¢=(4,+4), choosing the U(1) generator as the diagonal one from Ur. Finally, 
for spin 1/2, we have for the leptons /,=(3,—4), which combines with ¢ to pro- 
duce (0,0)@(0,—1), and /p=(0,1). Similarly, for the quarks we have q,=(%, 4), and 
qr=(0,—4 + 5). (We use for undotted spinors the convention “L” = fermion, “R” = 


antifermion.) The Lagrangian is then 
L=1,+I1o0+ Liye 


Ly = ga F?(V) + gotr F*(W) 
Lo = tr[3(Vo)'(Vo) + g"(b"b — 5m”)”] 


Layo = tr(WiVy) + tr (a ) grqtd + Alplrd @ + he 


where the fermions W = (qz, dr, lz, lz), and the SU(2)@U(1) covariant derivative acts 


as 


Vo = 06 + Wo — 15V6(5 *) 
Van = Oq1 — iqneW + i§V qn 
Var = O9n + igVar[—31 + (4°) 
Vip = Ol, —ilpW —igVI, 
Vir = dg t+iVle 


(The infinitesimal gauge transformations have the same form, dropping the deriva- 
tive term and replacing the gauge field with the corresponding gauge parameter.) 
For simplicity we have ignored the indices for color (and its gauge fields, treated in 
the previous section), families (treated in the following subsection), and spin. We 
have also used matrix notation with respect to the local SU(2) (gauged by W) and 
the global SU(2) (explicitly broken in Li/2 by the gauging of a U(1) subgroup, the 
Yukawa couplings, and the chirality of the massless neutrinos): Thus W is a traceless 
hermitian 2x2 matrix, ¢ is also 2x2 but satisfying the “reality” condition given above 
(traceless antihermitian plus real trace), gz, gr, and J; are 2-component rows, and 
lp is a single component. (By definition, the diagonal parts of W and ¢ are electro- 
magnetically neutral.) The quark Yukawa coupling is diagonal in the broken SU(2) 
to preserve the local U(1) symmetry. (The tr here is trivial for the lepton Yukawa 
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term, but we have left it for generalization to more than one family.) Explicitly, we 


can write 


and for the lightest family (see subsection IC4) 
qu = (dz uz), dr = (dr ur), I; = (ex v), lIR=erR 
In the unitary gauge for the local SU(2), 
G= Fel,  (~e)=m 
where y is a single real scalar, the simplifications to the Lagrangian are 


Ly > 7(0p)? + ge" tr{[W — 9V (4 PR + gr" (—? — m?)? 


; A, 0 1 


We then can expand y about its vacuum value m: The lowest order terms give masses 
for most of the vectors and fermions: The massless fermions are the neutrinos, while 
the massless vector gauging the unbroken U(1) (a combination of the original U(1) 
with a U(1) subgroup of the SU(2)) is the photon (of electromagnetic fame). The 
mass of the remaining vectors accounts for the weakness and short range of the “weak” 


interactions. 


Exercise IVB2.1 
Diagonalize this Lagrangian with respect to the mass eigenstates. For conve- 


nience, normalize 
1 / 
9 = 3390008 Ow, = g = gosin Ow 


where Oy is the “weak mixing (Weinberg) angle”. 
a Find explicitly the masses for all the particles in the Standard Model (first 
., A. Show from 
the experimental values for the vector masses given in subsection IC4 that 
sin’Oyw & .223. 


family for fermions) in terms of the couplings m, A, go, dw, A 


b Find all the other couplings of the mass eigenstates. Show that, with the 


conventional electric charge assignments, 
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(Hint: Rather than rescaling the vectors, note that the generated mass term, 


and the given couplings of V and W, suggest defining 
W=W-3V(5°) = VaeythZ, V2W =74+hkoZ 


in the conventions of subsection IIA1, for the new fields Z and photon 47, for 


appropriate constants k;.) 


Note that, unlike the strong (chromodynamic) or purely electromagnetic (or even 
gravitational) interactions, the weak interactions break every discrete spacetime sym- 
metry possible. (The others break none. CP violation will be discussed in the 
following subsection. Of course, CPT is always preserved.) Sometimes this is at- 
tributed to the presence of a chiral symmetry, used to reduce 4-component spinors to 
2-component; however, we have already seen that in general chiral and parity symme- 
tries are unrelated. (You can have either without the other. This fact will be further 
discussed in subsections IVB4 and VIIIB3.) A better explanation is to attribute 
P and C to doubling, which converts spinors from 2-component to 4-component: 
2-component spinors are the simplest description of helicity/spin $3 4-component 
spinors are useful only to manifest a larger symmetry, when it exists. The weak inter- 
actions violate parity because the neutrino is not doubled, and because the fermions 


that are doubled no longer have a symmetry relating their two halves. 


3. Families 


In the Standard Model (and its simpler generalizations) there is no explanation 
for the existence of more than one family of fermions. However, the existence of 3 
families does have interesting consequences. Most of these follow from the form of the 
Yukawa couplings, and thus the fermion masses. In subsection IVB1 we considered 
redefinitions of the fermion fields as unitary flavor transformations. These allowed us 
to obtain the simplest form of the mass matrices, since they were not flavor singlets, 
and thus transformed. We now perform similar transformations, but only on the 
family indices, since transformations that don’t commute with the gauge symmetries 
would complicate the other terms in the action. Now ignoring spin, color, and local 
flavor indices, and using matrix notation for the family indices, the fermions transform 


as 


=e. Gre galas, t,—0g; Ip] lRUiR* 


where gz, dr+, /z, and lp have m components for the m families. qgr+ are the 2 


components of the (explicitly broken) global flavor doublet gz. We thus have 5 U(m) 
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symmetries, all broken by the Yukawa couplings: These field redefinitions induce 


transformations on them, 


At _ qLAg U ype: A = UipAUR 


As in subsection IVB1, Ujr+ and Ur can be used to make A+ and A hermitian. 


Then Uj, can be used to make A diagonal, also as in subsection IVB1, leaving a U(1)™ 
symmetry Uj, = Uir, corresponding to separate conservation laws for electron number 
(including its neutrino), muon number, and tauon number for the 3 known flavors. 
However, the quark sector works a bit differently: We can use Uy, to diagonalize A, 
or A_, but not both. This leaves another U(1)” symmetry Ug, = Ugr+ = Ugr_. If A+ 


has been diagonalized, then 1 of the m U(1)’s, corresponding to total quark number 


(baryon number) conservation, leaves A_ invariant, while the remaining m—1 U(1)’s 
can be used to eliminate some of the phases of the complex off-diagonal components 
of A_. 


The remaining global flavor symmetries are thus m lepton U(1)’s and 1 quark 
U(1). The remaining Yukawa couplings are the real, diagonal A, describing the m 
masses of the massive leptons (the neutrinos remain massless), the real, diagonal 
A,, giving the m masses of half of the quarks, and the hermitian A_, consisting 
of m diagonal components, describing the masses of the other quarks, m(m—1)/2 
magnitudes of the off-diagonal components, and (m—1)(m—2)/2 phases of the off- 
diagonal components. These phases violate CP invariance: CP, besides its affect on 
the coordinates, switches each spinor field with its complex conjugate. Since the 
complex conjugate term in the action uses the complex conjugates of the A’s, this 
symmetry is violated whenever any of the components have imaginary parts (after 
taking into account all possible symmetries that could compensate for this, as we 
have just done). Note that CP is violated only for 3 families or more. (C and P 
are separately violated for any number of families by the SU(2)®U(1) coupling: As 
discussed in subsection IVB1, C invariance of the strong interactions is the symmetry 
di < Gr.) Since we can choose to transform away the phases in the subsector of the 
2 lighter quark families, the large masses of the heavier quarks suppress this effect, 


accounting for the smallness of CP violation. 


Since observed particles are mass eigenstates, it’s convenient to perform a further 
unitary transformation (the “Cabibbo-Kobayashi-Maskawa matrix”) that diagonal- 
izes the mass matrix. Although this is clearly possible by the arguments of subsec- 


tion IVB1, it is not part of the unitary transformations considered in this subsection 
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because it does not commute with the SU(2) gauge symmetry: After such a trans- 
formation, we find that the components of each SU(2) quark multiplet are linear 


superpositions of different families. 


Exercise IVB3.1 
Perform this diagonalization explicitly for the case m=2 (two families), using 
the two lightest families of quarks and leptons as listed in subsection IC4. 
Which particles mix? Parametrize this mixing by an angle @, (the “Cabibbo 
angle” ). 

An important experimental result with which the Standard Model is consistent is 
the suppression of “flavor-changing neutral currents (FCNC)”. The two electrically 
neutral gauge fields in this model, the Z and the photon ¥, couple to currents that are 
neutral with respect to the U(1) symmetries associated with each of the quark (flavor) 
numbers. This is true by construction before the unitary CKM transformation, but 
this transformation also leaves these two currents invariant (the “Glashow-Iliopoulos- 
Maiani mechanism”). Thus, at the classical level we do not see effects such as the 
decay K° — Z > pt, which would violate this “conservation law”. Furthermore, 
the quantum corrections are suppressed (though nonvanishing) for similar reasons: 
For example, the lowest-order nonvanishing quantum correction comes from replacing 
the Z witha WtW pair. Without the CKM matrix, this contribution would vanish; 
treating CKM, and its resulting contribution to quark masses, as a perturbtation, the 
resulting contribution is suppressed by a factor of me /m#,. The absence of FCNC is 


an important constraint on generalizations of the Standard Model. 


4. Grand Unified Theories 


The Standard Model gives a description of the weak and electromagnetic inter- 
actions that describes the spin-1 particles in terms of gauge fields, and accounts for 
all masses by the Higgs effect. However, it does not give any unification, in the sense 
that we still have 3 groups (SU(3), SU(2), and U(1)) for 3 interactions (strong, weak, 
and electromagnetic), and a large variety of spin-1/2 fields that are unrelated ex- 
cept by color and broken SU(2) flavor. Grand Unified Theories unify this symmetry 
by forcing all 3 gauge groups to be subgroups of a simple group, which is broken to 
SU(3)@SU(2)@U(1) by Higgs (and then broken to SU(3)@U(1) by more Higgs). This 
means introducing new spin-1 particles that are unobserved so far because of their 
very large masses. On the other hand, the known fermions are then grouped together 
in a small number of multiplets without introducing new fermions (except perhaps 


partners for the neutrinos to allow them to have small masses). Unfortunately, this 
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requires a more complicated (and ambiguous) Higgs sector, with separate spin-0 mul- 
tiplets and couplings for first breaking to SU(3)@SU(2)®@U(1) and then breaking to 
SU(3)®U(1); we won’t discuss those Higgs fields here. 


The simplest such model uses the group SU(5). Recall the SU(3)@SU(2)@U(1) 
representations of each family of fermions: 


qt = (3, 5) z)s qR+ = (3, 0, a) qr- = (3, 0, =F) i, = (1, on =5); lR= (1,0, 1) 


where the first argument is the dimension of the SU(3) representation (3 being the 
complex conjugate of the 3), the second is the SU(2) isospin, and the third is the U(1) 
charge. An SU(3)@SU(2)@U(1) subgroup of SU(5) can be found easily by taking 
the 5-component defining representation and picking 3 components as the defining 
representation of SU(3) and the other 2 for that of SU(2): Le., consider a traceless 


hermitian 5x5 matrix as an element of the SU(5) Lie algebra, and take 


Su(3) — 41 x U(1) 0 ) 


(SU(5)) > ( 0 SU(2) +42 x U(1) 


or in other words 
5 — (3,0,—%) @ (1,5, 5) 
From this we recognize the fermions as falling into a 5 @ 10, where the 10 is the 
antisymmetric product of two 5’s, which consists of the antisymmetric product of the 
two 3’s (a 3), the antisymmetric product of the two SU(2) doublets, and the product 
of one of each: 
5 7 (3, 0, s) QD er 3; —3) = dR+ D ly 
10 = (3, 0, —2) ® (1,0, 1) ® (3, 5; z) =dr- © lr D dr 


Exercise IVB4.1 
Find the symmetric product of 2 5’s, and its decomposition into representa- 
tions of SU(3)@SU(2)@U(1). 
A more unifying model is based on SO(10). A U(5) subgroup can be found from 
the spinor representation by dividing up the set of 10 Dirac y matrices into two 
halves, and taking complex combinations to get 5 sets of anticommuting creation and 


annihilation operators. (See exercise IC1.2.) The Dirac spinor is then 
(4 —3) D (5, —3) QD (10, —3) D (1 +5) ® (5, 3) ® (1, 3) 


in terms of the SU(5) representation and the U(1) charge. This Dirac spinor is 
reducible into Weyl spinors 16 © T6; in fact, i/2y_ is just (—1)*+!/? in terms of the 
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U(1) charge Y. (The SO(10) generators are even in oscillators, and thus do not mix 
even levels with odd.) We then have 


16 — (1,3) @ (10, —5) @ (5, 8) 


Ignoring the U(1) charge, these are the multiplets found for each family in the SU(5) 
GUT, plus an extra singlet. 


A simple way to understand this extra singlet is to look at a different path of 
breaking to SU(3)@SU(2)@U(1): Looking at the vector (defining) representation of 
SO(10), we can break it up as 6+4 (in the same way we broke up the 5 of SU(5) 
as 3+2) to get the subgroup SO(6)@SO(4)=SU(4)®SU(2)@SU(2). We can also see 
that a Dirac spinor of SO(10) (16616) will be a Dirac spinor of SO(6) (4@4) times 
(not plus) a Dirac spinor of SO(4), while the Dirac spinor of SO(4) is a defining 
representation of one SU(2) ((4,0)) plus a defining representation of the other SU(2) 
((0,3)). Thus, 

16 — (4, 4,0) @ (4,0, 3) 


where we have used the fact that y_, (used for projection to Weyl spinors) of SO(10) 
is proportional to the product of all the y-matrices, and thus the product of y_,’s for 
SO(6) and SO(4). 

Looking at this model (“Pati-Salam model” ) as an alternative to SU(5) (but with 
a semisimple, rather than simple, group, so it unifies only spin 1/2, not spin 1), we 
now look at breaking SU(4)-U(3)= SU(3)®@U(1) (using 4=3+1, as we did 5=3+2 
for SU(5)), and breaking one SU(2)— U(1). We then find 


(4,0, 3) > (3, 5,0, 3) @ (3, 3, 0, —5) ® (1, 1,0, 3) ® (1, -1, 0, —4) 


NIH 


= qr+ © Gr- Olea Ole 


where the arguments are the SU(3) representation, the U(1) charge from SU(4), 
the SU(2) isospin, and the U(1) charge from the broken SU(2). If we choose the 
U(1) charge of SU(3)@SU(2)@U(1) as —1/2 times the former of these two U(1) 
charges plus 1 times the latter, this agrees with the result obtained by way of SU(5). 
However, we now see that all the left-handed fermions are contained within one 
SU(4)@SU(2)@SU(2) multiplet, and the right-handed within another, but with a 
partner for the neutrino. Also, one of the SU(2)’s is that of SU(3)@SU(2)@U(1), 
while the other is the other SU(2) of the Standard Model, which was broken explic- 
itly there to U(1), whereas here it is broken spontaneously. Thus, there is a local 
chiral SU(2)@SU(2) flavor symmetry. 
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SU(5) SU(4)@SU (2), @SU(2)r 
NG / 106 GeV? 
SU(3)@SU(2),@U(1)r 
| #100 GeV 
SU(3)@U(1) 


Furthermore, the SU(4)@SU(2)@SU(2) model is invariant under C: In general, C 
is just a permutation symmetry. In this case, it simply switches the two multiplets 


of each family, 


Combining with the usual CP, this model is thus also invariant under P: 
Ps (4, 4,0) — (4,0, 5)* 1.€., (4, 3,0) — v4(4, 0, 5) 


But both C and P are broken spontaneously on reduction to the Standard Model. 
However, SO(10) lacks C and P invariance (contrary to some statements in the liter- 
ature), since there is only a single complex representation for each family of fermions 
(and thus no nontrivial C; of course, there is still CP, at least for the vector-spinor 
coupling, as always). In fact, the C of SU(4)@SU(2)@SU(2) is just an SO(10) trans- 
formation: Although SO(10) is not O(10) (which is why it lacks a C), it still includes 
reflections in an even number of “axes”, since reflection in any pair of axes is a 7 rota- 
tion (just as for SO(2)). Thus, breaking 10 — 6 + 4 includes not only SO(6)@SO(4), 
but also the reflection of an odd number of the “6” axes together with an odd number 
of the “4” axes — a combined “parity” of both SO(6) and SO(4). (They are all the 
same up to continuous SO(6)@SO(4) transformations.) This parity of the internal 


space is the C given above. (We saw a similar situation for O(2) in subsection IVB1.) 


The identification of C is somewhat semantic in a nonabelian gauge theory (ex- 
cept for unbroken U(1) subgroups), since it is defined by changes in sign of unobserved 
charges: The C appearing above at an intermediate stage of breaking of the SO(10) 
GUT originates as a global symmetry of only the Higgs sector, leaving all “funda- 
mental” particles with spin invariant. After breaking to SU(4)@SU(2)@SU(2), the 
vectors and the spinors are composites of the original ones and the Higgs responsible 
for the breaking, so they pick up this symmetry. (In the same way, the spinning 
particles of the Standard Model pick up the broken global SU(2) of its Higgs.) 
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Since GUTs unify quarks and leptons, they allow decay of the proton. However, 


since this requires simultaneous decay of all 3 quarks into 3 leptons, it is an extremely 


unlikely (i.e, slow) decay, but barely within limits of experiment, depending on the 


model. Proton decay is still unobserved: This eliminates the simplest version of the 
SU(5) GUT. 
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In section IIC we studied some general properties of supersymmetry in arbitrary 
dimensions, and its representations in D=4. We now consider 4D interactions, by 
introducing gauge fields defined on superspace, and their actions. A complete dis- 
cussion of supersymmetry would require (at least) a semester; but here we give more 
than just an overview, and include the basic tools with examples, which is enough for 
many applications. Quantum aspects of supersymmetry will be discussed in chapters 
VI and VIII, supergravity in chapter X, and some aspects of superstrings in chapter 
XI. 


1. Chiral 


We first consider some field equations that appear in all free, massless, super- 
symmetric theories. Of course, since the theory is massless it satisfies the massless 
Klein-Gordon equation by definition: 1 = 0. From our earlier discussion of general 
properties of supersymmetry, we also know that pqyh = pda = 0. These don’t 
look covariant, but noticing that pq differs from pd only by 60) terms (because of the 


index contraction), which already vanishes, we have the field equations 
pds® = ped, = 


These equations imply the Klein-Gordon equation, as seen by hitting them with 
another d and using the anticommutation relations {da, da} = Pop They imply 
stronger equations: By evaluating at 6 = 0, d,@ yields a spinor component field w,, 
and we find 

pbs = pv = 0 
the usual for massless spin 1/2. 


Another equation that can be imposed is the “chirality” condition 
dg? = 0 


where ¢ now refers to such a “chiral superfield” (and thus @ to an “antichiral” one, 
d.@ = 0). This requires that ¢ be complex, otherwise we would also have dad = 0 
and thus p@ = 0 by the anticommutation relations. The component expansion is 


given completely by just the d’s and not the d’s: 


@| = A, (d.)| — Vas (d*¢)| =B 
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where A and B are complex scalars, and we use the normalization 
2_ 1a 
d* = 5d"da, 


All other components are x-derivatives of these, since the d’s can be pushed past the 
d’s (producing p’s) until they annihilate ¢. Another way to state this is to use the 
fact 

i= e F/29, Ul? ds = eU/2§.e U2. fee 0°O"D 5 


to solve the chirality constraint as 
(a, 0,0) =e"? b(a, 8) 


where b is independent of 6: It is defined on “chiral superspace”. (In this equation U 
generates a complex coordinate transformation.) Another way to solve the chirality 
constraint is to use the covariant derivatives: Since dadgd, = 0 by anticommutativity 
(and similarly for d’s), 


dap=0 => =d*y 
where w is a “general” (unconstrained) complex superfield. It is the “prepotential” 
for the field ¢. 

Exercise IVC1.1 


Let’s analyze the supersymmetry generators gq, Jz in this case. 


a Find similar expressions for g,@ in terms of e4/?. 


b Find q,¢ on ¢ in terms of just 6 and 8/00 (no 6 nor 9/06). 
Exercise IVC1.2 


Show that the prepotential has a gauge invariance, under which ¢ is invariant. 
(Hint: Use the same identity that led to the prepotential. ) 


From the anticommutation relations we find 
[a*, d?] = pd, 
Since this must vanish on @¢, we find 
dal’o=di@bd=0 => Ppt =0 = @&¢=constant 


(We can safely ignore this constant, at least when considering the free theory: It 
corresponds to a term in the action linear in the fields.) This field equation, together 
with the chirality constraint, is sufficient to determine the theory: A is the usual free 


(complex) scalar, ¢, is the usual free spinor, and B is a constant. 
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To describe interactions of this (“scalar”) multiplet, we keep the chirality condi- 
tion, since that greatly simplifies the field content of the superfield. In fact, this is 
clearly the simplest off-shell superfield we can define, since it already has the smallest 
number of fermions (as do the coordinates of chiral superspace). (“Off shell” means 
all components less gauge degrees of freedom.) This means that the equation d?¢ = 0 
will be generalized, since it implies the Klein-Gordon equation. The simplest way to 


do this is by constructing an explicit action, our next topic. 


2. Actions 


The construction of actions in superspace is different from ordinary theories be- 
cause the geometrically simple objects, the potentials, are constrained, while the un- 
constrained objects, the prepotentials, can be awkward to work with directly. (This 
problem is magnified with extended supersymmetry, whose actions we don’t consider 
here.) 


We start with the simplest supermultiplet, the chiral superfield. Since chiral 
superfields are defined on chiral superspace, a natural generalization of a potential 


(nonderivative) term in the action to superspace is 


e= / dx 6 f(¢) + hc. 


in terms of some function (not functional) f of chiral superfields ¢ (the “superpoten- 


tial”). We can ignore any 6 dependence because it contributes only total derivatives: 
g=eMG(x,0) > f(d) =e! FO) 


Integration over @ is defined as in subsection I[A2; however, now we can replace partial 


derivatives with covariant ones, since the modification is again only by total deriva- 


puweo- face 


with an appropriate normalization. This turns out to be the most convenient one, 


tives: 


since it allows covariant manipulations of the action, and the @ integration can be per- 
formed covariantly: Since we know that the result of 0 integration gives a Lagrangian 


that depends only on x, up to total derivative terms, we can evaluate it as 


/ dx #6 f(¢) = / dx [d2f(¢)] 


2 / de [F'(6l\(26)| + f"(l) (a9) |(da9)l] 
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(suppressing indices on multiple @’s). This gives the result directly in terms of com- 
ponent fields, using the covariant method of defining the component expansion: In 


the conventions of the previous subsection, this part of the action becomes 
[ee AB + FA) pueo] 


We now consider integration over the full superspace. As a generalization of the 
above, we can write 


/ dx d*@ K(¢,¢) = / dx (d?d’K)| 


Supersymmetric versions of nonlinear ¢ models can be written in this way; here we 
consider just the case where K is quadratic, which is the one interesting for quantum 
theory. Since a function of just ¢ (or just @) will give zero in the d*0 integral, we 


choose 
K=-¢¢ => Sy) = — f de a6 6 
Explicitly, 
Ly = PE(—4¢) = —0?(P6)o = -($04)o + (10° d3b)dad — (@#6)(d6) 
+ -ALOA + b%0,°b, — BB 
where we have used the commutation relations of the covariant derivatives to push all 


d’s past d’s to hit ¢. Clearly, this term by itself reproduces the results derived in the 


previous subsection based on kinematics, so it is the desired massless kinetic term. 


We can now see the influence of adding the superpotential term to the action: 
The result of combining the two terms, and then eliminating the auxiliary field B by 


its equation of motion, is 
So+ $1 L= AOA + b%idaPby + [f(A)P + [fF (A)EY a + heel 


For example, a quadratic f gives mass to the physical scalar and spinor. This action is 
invariant under modified supersymmetry transformations, where the auxiliary fields 
are replaced by their equations of motion there also; those transformations then 
become nonlinear in the presence of interactions. Note that the scalar potential is 
positive definite; this is a consequence of supersymmetry, since it implies that the 


energy is always positive. 


Exercise IVC2.1 


These results generalize straightforwardly: 


a Find the explicit form of the component-field action for arbitrary K(¢', ¢;) 


and f(¢') for an arbitrary number of chiral superfields ¢’, including all indices. 
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b Eliminate the auxiliary fields from the action, and find the modified super- 


symmetry transformations. 


c Show by direct evaluation that the action is still invariant. 


As a notational convenience, we can drop the “|” after expanding a superspace 


action in components: For example, we can write simply 
Va = dad, B= do 


After performing the 6-integration as above by using derivatives d and d, and then 
“evaluating” these derivatives on ¢ by writing w and B, the component action is 
expressed completely in terms of such superfields and only spacetime derivatives 0, a 
This component action is independent of 6 (the Lagrangian is independent up to total 
spacetime derivatives): This is the statement of supersymmetry invariance. Thus, we 
can choose to evaluate at 6 = 0, or 6 = e€, or whatever; it is irrelevant. It is then 
understood that the relation to the usual component actions is simply to treat the 
superfield as a component field, since the 6-derivatives (in d and { d@) have been 


eliminated. From now on we will generally drop the |’s. 


The above results can also be derived from the superfield equations of motion 
by varying the action. Since @ is constrained, it can’t be varied arbitrarily; we vary 
instead the prepotential w (@ = d?w). For example, we find d?@ = 0 (and the complex 
conjugate) from the free action. Effectively, since chiral superfields are essentially 
independent of 6, not only integration is modified, but also (functional) variation. 
Since a chiral superfield is (up to a transformation) an arbitrary function on chiral 
superspace, we define 

5S{d] = yl dx #6 (6¢)°" 
dp 
for an arbitrary variation of a chiral superfield ¢, and similarly for varying ¢. In 


evaluating such variations, we make use of the identities 
fedor= fareees 
PPo=106 (P@P = 108) 


Thus, to vary a general action, it is convenient to first integrate over 6, and then vary 


in the naive way: For example, 


s=- fds ‘0 bb+ | [ax a0 f(b) +h.c. 
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Exercise [IVC2.2 
Check for this action that the component expansion of the superfield equations 


of motion agree with the variation of the corresponding component action. 


3. Covariant derivatives 


The supersymmetric generalization of nonabelian gauge theories can be derived by 
similar methods. We first write the supersymmetry covariant derivatives collectively 
as 

da = (da; da; Oaa) = EaM@Ou 


Ou = (Ou, fis Im) = 0/02, 2 = (64, 6", 2) 


Unlike the nonsupersymmetric case, the “vielbein” E4™ has @ dependence even in 


“flat” superspace, and thus the “torsion” T is nonvanishing: 
[da, dp} = Tap@do 

PHT, Vaal, rest =0 

a a B 
We now gauge-covariantize all the supersymmetry-covariant derivatives: 
Va=d4atiA, 
The covariant field strengths are then defined as 
[Va, Ve} = Tano Vo t+ iFap 


From our analysis of general representations of supersymmetry in D=4 in subsection 
IIC5, we know that the simplest supersymmetrization of Yang-Mills is to include a 
spinor with the vector, in terms of physical degrees of freedom. (The spinor and 
vector each have two physical degrees of freedom, one for each sign of the helicity.) 
Off shell, Fermi and Bose components must still balance, so there must also be an 


auxiliary scalar. From dimensional analysis, the field strengths must therefore satisfy 


Fog Psy Fab 0; Z 0,83 = —~*CapW 5, Ff 4,88 = —*CagWo 


where W,| is the physical spinor. 


The constant piece of the torsion implies stronger relations among the field 
strengths than in nonsupersymmetric theories. For super Yang-Mills we find from 
the Jacobi identity for the covariant derivatives the Bianchi identity for the field 
strengths 

VaF ec) = Tap)? F nc) 
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Specifically, the dimension-1 constraints above on the field strengths imply the dimen- 


sion-3/2 algebraic constraint that defines W,, as well as 


Fos 


nage Cap3V (aW yy +C. $V Wp) 


ap 2 
They also imply that W, is covariantly chiral and satisfies a “reality” condition, 


VaWe=0, V°Wa+ VWs =0 


The most straightforward way to derive these results is to just evaluate the Jacobi 
identities directly. We begin with a weaker set of conditions, both of dimension 1, that 
will be found (in the following subsection) to be necessary and sufficient for solving 


explicitly. One directly determines the vector derivative in terms of the spinor ones: 
Pos) & ~1Vse = {Ve Val 
Since one could always define the vector covariant derivative this way, imposing this 


condition simply eliminates redundant degrees of freedom. 


The remaining constraint (including its complex conjugate) allows coupling of 


super Yang-Mills to the chiral superfield: 


Vab=0 + 0={Va, V3} =iF 56 
It also implies the maintenance of certain free identities, such as 
VaVe = 31V as Val 7 ree Vat} = CaaV* 


(Such constraints appear also for first quantization, e.g., in superstring theory, when- 
ever a supersymmetric system is put in a background of a supersymmetric gauge field 
of higher superspin. This should not be confused with background field equations 
imposed by any gauge system put in a background of the same type: see subsection 
VIB8.) 


Thus, our minimal set of constraints can be written directly in terms of the field 


strengths as 


fp=Fg=F220 


but for our purposes it will prove more convenient to write them directly as (anti)com- 
mutators: 

1 View Var =4 Vas Vas} =, tN es Va} =-1V.3 
The solution to the dimension-3/2 Jacobis are then 


[Vie{Ve,Vy}] =O => trivial 


282 IV. MIXED 


[Vios {Vay Vi} +14, {Vos Va}]=0 = [Va Vag] = CasWs 


for some field W, simply applying the constraints to drop {Va, Vg} and replace 
{Ves Va} with V5. Similarly, we find from the dimension-2 Jacobis 


{Vas [Va Vials om [Vs LY ee Vat] =0 = VaWs =0 


[Vad {Va, V5}] + {Va,[V5, Voal} + (V4, [Va Vaal} = 0 
> [Vad V 65] — i(Cophs3+Cxafos), Joi = 5V(aWg), V°Wa +V°W . = 0 


where we separated the last equation into its (Lorentz) irreducible pieces. (The 


dimension-5/2 and 3 identities are redundant.) 


Exercise IVC3.1 
Explicitly evaluate all the remaining Jacobi identities, and show that they 


imply no further conditions on W,. 


4. Prepotential 


We saw in the previous subsection that coupling super Yang-Mills to matter gave 
directly one of the minimal constraints on the super Yang-Mills fields themselves. 
Hence, as for ordinary Yang-Mills, the definition of the gauge theory follows from 
considering the transformation of matter, and generalizing it to a local symmetry. As 
for self-dual Yang-Mills (see subsection HIC5), the vanishing of some field strengths 


implies that part of the covariant derivative is pure gauge: 
(VaVar=U = Vase" 


However, since {Va Va} # 0, this gauge transformation 2 (“prepotential”) is com- 


plex. We therefore have the covariantly chiral superfield 
Vad = 0, 


Alternatively, we could combine this exponential with that already contained in the 


free spinor derivative: 
Ve = emi alae Q = ever g. a0 = 0 


U + 22 is the analog of the covariant derivative for the Yang-Mills prepotential. This 
is a hint at supergravity: U is just the flat piece of the supergravity prepotential. 
We thus see that supersymmetry automatically gives gravity the interpretation of the 


gauge theory of translations. 
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Component expansions are now defined with Yang-Mills-covariant derivatives: 
VaP = Vas V*o=B 


VaWs = fas t+iCasD,  V?Wa=-iVa'W, 
where we have used the Bianchi identities for W, and fg (not to be confused with Fy ,) 
is the usual Yang-Mills field strength (in spinor notation). The “vector multiplet” 
thus consists of the component fields A, (the gauge field whose strength is f), Wa, 
and D (auxiliary). (As explained earlier, we drop all |’s.) 

Note that the gauge parameter is real, while the matter multiplet is (covariantly) 
chiral. The resolution of this apparent inconsistency is that solving the constraints 
introduces a new gauge invariance, just as solving the source-free half of Maxwell’s 
equations (really just constraints, not field equations) introduces the potential and 


its gauge invariance: 
; - _ wes 7 
Vi=e Ve" Vea ie = eae. g4=0 


¢' =e, b= ed = f =e¢ 
This suggests the definition of a new (“chiral”) representation, where we use the 


obvious field db and the chiral gauge parameter A replaces the real one K: Making a 


nonunitary similarity transformation, 


Vaze’Vae® => Va= ds, Ve= e "dye", eV = e@e® 


d=, G=de? => db=0, 6= (dle 

V4 _ eV ye 4 = ciAgV --iA 
Alternatively, we can also include U in the transformation as above; then U and V 
appear only in the combination U + V. 


Exercise IVC4.1 
Show that the explicit expression for the field strength W, in terms of the 


prepotential V in the chiral representation is 
W. = —id?(e~" dae” ) 


Show this expression is chiral. 


Exercise [IVC4.2 
In the Abelian case, give an explicit component expansion of the prepotential 
V, such that the vector potential A,, the physical spinor W,, and the auxiliary 
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field D appear as independent components. Note that the other components 
do not appear explicitly in component expansions when gauge-covariant ex- 
pansion (V...|) is used. The component (nonsupersymmetric) gauge where 
these components are set to vanish is the “Wess-Zumino gauge”, and is the 6 


part of the radial gauge of subsection VIB1 below. 
Exercise IVC4.3 


For some purposes (like quantization) we need the explicit form of an in- 


finitesimal gauge transformation of V. Show this can be written as 


(Hint: Consider e~’ de”, and think of 5 as an operator, as for the expansion 
of Va =e “dae”. La was defined in subsection IA3.) 


5. Gauge actions 


Generalization of actions to super Yang-Mills theory is straightforward. Matter 
coupling is achieved simply by replacing the chiral superfields of the matter multiplets 
with Yang-Mills-covariantly chiral superfields. The coupling can be seen explicitly in 


the chiral representation: In the kinetic term, 
ob = (g)leVe 


while in the { d?@ term all V-dependence drops out because of gauge invariance. (The 
superpotential is a gauge invariant function of the @’s, and the transformation to the 
chiral representation is a complex gauge transformation. The fact that the gauge 
transformation is complex is irrelevant, since the superpotential depends only on ¢ 
and not ¢.) Component expansion can be performed covariantly by replacing d’s with 
V’s in the definition of # integration: Since the Lagrangian is a gauge singlet, this is 
the same acting on it, although individual terms in the expansion differ because the 
fields are not singlets. Similarly, d? can be replaced with V’ also when performing 0 
integration for purposes of varying an action with respect to a chiral superfield. This 
is equivalent to gauge covariantizing the functional derivative (e.g., by transforming 
from a chiral representation) as 

9029) _ 5 — 2540 — 6") 

d0(2’, 0’) 
Usually we will drop the “~”’s on ¢ and ¢, when the representation is clear from the 
context by the use of explicit V’s. 
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The action for super Yang-Mills itself follows from dimensional analysis: Since 
each @ integral is really a @ derivative, d?6 integration has mass dimension +1, the 
same as a spacetime derivative. Since the Lagrangian for a physical spinor, in this 


case W%, has a single such derivative, dimensional analysis says the action must be 
SsyuM = —sir fae a6 sW°Wa 


where the (covariant) chirality of W° allows integration over chiral superspace. (Sim- 
ilar analysis applies to the matter multiplet, where { d*@ takes the place of a 1 for 


the scalar ¢.) Replacing { d?@ > V?, we evaluate the component expansion as 
Savu = &tr / dx (2f° fap + WV PW; — D?) 


Another term we can write, for superelectromagnetism (supersymmetrization of an 


Abelian gauge theory) is the “Fayet-Iliopoulos term” 


Ser=¢ [dv ov=¢ fae D 


which involves only the auxiliary field D. (The analog for the chiral scalar superfield 
is [ dx d?6 ¢.) 
Exercise IVC5.1 
Derive the supersymmetric analog of the Stiickelberg model of subsection 
IVA5, by coupling an Abelian vector multiplet to a massless chiral scalar 
multiplet using the symmetry generator T defined there. (G — —iT in trans- 


formation laws, covariant derivatives, etc., on 6, where T¢ = 1 > T?¢ = 0.) 


a To couple the gauge field it is necessary to start, as usual, with a (quadratic) 


matter action that is globally invariant under this symmetry: 
Sy = | dr a'o ¥(6- 4)" 


(At this point this is the usual, since only the cross-term survives, but this 
will not be the case for the covariantly chiral superfields.) Find the super- 
symmetric gauge coupling, and express the resulting action in terms of V and 
b. 

b Use this result to find the mass term for V in the gauge db =, 


Another interesting form of the action uses a generalization of the Chern-Simons 
form defined in the discussion of instantons in subsection IIIC6. In superspace, the 


calculation of the field strength with curved indices is modified to 


Vu = Em“Va=Ou+iAm, —i[Vu, Vv} = Fun = Ev*Ew? Fae 
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where we have left sign factors from index reordering in the last equation implicit. 
Although the curved-index expressions are not as useful (for example, for seeing which 
components vanish by constraints), we can see easily that some arguments used in 
nonsupersymmetric theories carry over to superspace. Thus, we can define the super 
Chern-Simons form by 


atr Fiuy Fea) = 501m Bnrg) 
Bune = tr($A On Ap) + iz Atm Ay Ap)) 


Converting to flat superspace (again with some implicit sign factors), 
Basco = Ex Ep’ Ec” Bunp = tr(sAtadpAcy — FAqaT acy” Ap + iz AfaABAcy) 


In terms of this expression, the super Yang-Mills action can be written simply in 


terms of the spinor-spinor-vector part B, Be of Bago as 
Ssyma = sidtr [ de d‘6 Be 


Note that the fact that the curl of B is gauge invariant implies that B transforms 
under a gauge transformation as the curl of something, and thus the integral of any 
part of B is gauge invariant (up to possible torsion terms: see the exercise below). 
Furthermore, we can drop the Fup = 0 constraint on the A in this action; it follows 


from variation with respect to Ap: One simple way to check this action is to use the 

chiral representation A, = 0: Then only the As d Ao and (A,3)? terms contribute, 

and A, = id Aa, while W, = d?Aq, so f d? integration gives — f dx d?0 W?. 
Exercise IVC5.2 


Derive the expression for Bago directly using only flat indices: 


a Start with FiapFcp) expressed in terms of T and A, and write it as a total 


derivative plus torsion terms. 


b Do the same for the gauge transformation of B. Show that the torsion terms 


do not contribute to 6B, 2. 


The multiplets and couplings we have considered are sufficient to write a super- 
symmetric generalization of the Standard Model. Unfortunately, supersymmetry pro- 
vides no unification. To get the right symmetry breaking, it turns out to be necessary 
to provide a supersymmetry multiplet for each particle of the Standard Model: The 
spin-1 gauge bosons are accompanied by spin-1/2 “gauginos” (“gluinos”, “photino”, 
“Wino”, “Zino” ), the spin-1/2 leptons by spin-0 “sleptons”, the quarks by “squarks” , 
and the spin-0 Higgs’ by spin-1/2 “Higgsinos”. Furthermore, since a reality condition 
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can’t be imposed on chiral scalar multiplets, the Higgs scalars are themselves doubled. 
Ultimately, the success of supersymmetry depends on the experimental detection of 


these particles. 


6. Breaking 


The methods of section IVA can be generalized straightforwardly to supersymmet- 
ric theories: Goldstone bosons and Higgs fields become supermultiplets, etc. However, 
to obtain realistic models supersymmetry itself must be broken, since fermions and 
bosons with similar mass and other properties are not observed in nature. More 
specifically, since gravity is observed, any supersymmetric theory of the world must 
include supergravity, and thus the breaking must be spontaneous. (Explicit breaking 
would violate gauge invariance.) Then the gravitino, which gauges supersymmetry, 
will become massive by a superhiggs mechanism, by eating a Goldstone fermion. (See 
subsections XB6-7. If the graviton and gravitino are treated as composites, then this 


fermion could also be a composite.) 


We saw in subsection IIC1 that energy is always nonnegative in supersymmetric 
theories. In particular, from the same arguments used there we see that a state can be 
invariant under supersymmetry (q|w) = q'|w) = 0) if and only if it has zero energy. 
Any such state can be identified as the vacuum, since no state has lower energy. 
This means that the only way to guarantee spontaneous supersymmetry breaking is 
to choose a theory which has no zero-energy state. (Note that energy is uniquely 
defined by the supersymmetry algebra; there is no possibility of adding a constant 
as in nonsupersymmetric theories.) In theories with extended supersymmetry, the 
relation between supersymmetry and energy applies for each supersymmetry; thus 
supersymmetry is either completely broken spontaneously or completely unbroken. 
(An exception is central charges, which modify the supersymmetry algebra: See the 


following subsection.) 


Furthermore, physical scalars appear at 6 = 0 in matter multiplets, while auxiliary 
fields appear at higher order. Since supersymmetry breaking requires 6 dependence 
in a vacuum value of a superfield, this means an auxiliary field must get a vacuum 


value. 


A simple example of spontaneous supersymmetry breaking is the O’Raifeartaigh 


model; it has the Lagrangian 


3 
i . d'6 N° G6; + / d?0 (CH, + MOP; + B62) + hic. 
i=1 
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To study symmetry breaking we ignore derivative terms, since vacuum values are 


constants. Then the scalar field equations are: 


=f Eo =O —B,+¢+ AZ = —By + mAz + 2A,Ao = —Bs + mAq = 0 


OB; 


} 
5A, = B,0;0; f =0: 2AoBo = mBs + 2AoB, + 2A,Bo = mBo = 0 
(where 0; = 0/OA; on the superpotential f(A)). Since there is no solution for B; = 0, 
supersymmetry breaking is required. In general, for superpotential f(®), the field 
equations for B = 0 are f’(A) = 0, so a linear term is always needed for supersym- 


metry breaking. 


With Abelian vector multiplets, a Fayet-Iliopoulos term f d‘@ V can also generate 


such breaking, since it also is a linear term of an auxiliary field. 


Exercise IVC6.1 
Evaluate the Lagrangian — [ d*0 $¢ for covariantly chiral ¢ by using covariant 
d-integration, [d*0 = V2V. For the case of U(1) gauge theory, add the 
action for the gauge superfield with a Fayet-Iliopoulos term, and find the 
potential for the physical scalars by eliminating the auxiliary field D by its 
field equation. 


For simplicity (as in this chapter), we may want to ignore supergravity; however, 
we still need to take account of its contribution to breaking global supersymmetry via 
the superhiggs effect. The net low-energy contribution from the supergravity fields 
(assuming no cosmological constant is generated) is to introduce effective explicit 
supersymmetry breaking: Although the original theory is locally supersymmetric, we 
neglect the supergravity fields but not their vacuum values (in particular, those of 
the auxiliary fields). In particular, if the supergravity fields are bound states, then 
this procedure is essentially the classical introduction of nonperturbative quantum 


effects. 


Thus we consider adding terms to the classical action that break supersymmetry 
explicitly. The easiest way to do this is to introduce constant superfields (“spurions” ); 
this allows us to continue to take advantage of the superspace formalism (at both 
the classical and quantum levels). Since we are neglecting (super)gravity, and in 
particular its nonrenormalizability (see chapter VII), we consider only terms that will 
preserve the quantum properties of the unbroken theories. This will clearly be the 
case if we consider only the usual terms, with some fields replaced by spurions: This 
is equivalent to using background (fixed) fields, in addition to (but in the same way 


as) the usual field variables, performing all (classical/quantum) calculations as usual, 
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and then setting the background fields (specifically, the auxiliary fields, which are 


responsible for breaking supersymmetry) to constants. 


Thus, introducing constant (in x) chiral and real spurion fields 
g=Te V=rrr 


in terms of complex and real parameters c and r, in addition to the true fields @ and 


V, we have terms of the form 
[28 (06,907.00, ow, (Parvyew,], [20 8 vie" 


(and complex conjugates). These terms can preserve the usual gauge invariances, and 
can be shown to also preserve the desirable quantum properties of supersymmetry: 
The condition is that replacing the spurion field by 1 (instead of its above value) 
gives either 0 or a conventional term (one with coupling constant of nonnegative 
mass dimension). Another way to introduce these spurions (except perhaps for the 
@V crossterm, which is less useful) is as coupling constants, rather than as fields: 
Instead of introducing new terms to the action, we generalize the old ones, so the 
constant part of each coupling is the usual coupling, while its 6-dependent terms 
produce the breaking. 

Exercise IVC6.2 


Find the component expansions of the above explicit breaking terms. What 


are the mass dimensions of the constants c and r in the various cases? 


Exercise IVC6.3 
Expand the Lagrangian 


L=- fa bb+ [eo (20° + yd) + hic. 


in components. Find the masses. 


7. Extended 


The supersymmetry we discussed earlier in this chapter, with a single spinor coor- 
dinate, is called “simple (N=1) supersymmetry”; the generalization to many spinors is 
called “extended (N>1) supersymmetry” (for N spinor coordinates). N=1 supersym- 
metric theories, at least for spins<1, are most conveniently described by superspace 
methods. (There are also some definite advantages for N=1 supergravity at the quan- 


tum level.) On the other hand, the technical difficulties of extended superspace often 
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outweigh the advantages. (The main advantage of extended superspace is proving cer- 
tain properties of the quantum theories. Of course, extended supersymmetric theories 
are complicated in any case.) Alternative formulations of extended supersymmetry 


are either 

(1) on shell, 

(2) in terms of components (ordinary spacetime, not superspace), or 
(3) in simple superspace (manifesting only one of the supersymmetries). 


By going half way, using N=1 superfields to describe extended supersymmetry, 
some of the advantages of the superspace approach can be retained. In this subsection 
we will list some of the extended supersymmetric actions for lower spins in N=1 
superspace form. These actions can be obtained by: (1) using extended superspace 
to derive the component field equations (usually using dimensional reduction: see 
subsections XC5-6), and combining components into N=1 superfields, or (2) writing 
the extra supersymmetries in N=1 superspace form, and using them to determine the 


action. 


The simplest example is N=2 supersymmetry. As for any extended supersymme- 
try, its algebra can be modified by including Abelian generators Z (with dimensions 


of mass), called “central charges”: 
{Gis 7} = 5}D 33 { Giees qa} a Cana: {d, 7} = G30" 2; [Z, q| = [Z, qd = 0 


(where i = 1,2). In terms of dimensional reduction (for N=2, from D=5 or 6; see 
subsections XC5-6), the origin of these generators can be understood as the higher- 
dimensional components of the momentum. N=2 supersymmetry is sometimes called 


“hypersymmetry”, and N=2 supermultiplets, “hypermultiplets” . 


Our first example is the free, massive N=2 scalar multiplet: Since we already 


know the field content (see subsection IIC5), it’s easy to write the free Lagrangian 


Ligm,N=2 = — if a6 b' by +4 ( / PO mT by dj + he.) 


where the index “2” is for an extra SU(2) (not the one acting on the supersymmetry 
generators), broken by the mass term, and the mass matrix m!” is symmetric while 
mit = Crm" is hermitian. In other words, it represents a 3-vector of this SU(2), 
and thus a generator of the preserved U(1) subgroup, which we have used to define 


the central charge: 
Loy = my) by 
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The other N=2 multiplet of low spin is the vector multiplet. It also has a simple 


LsyM,N=2 = air (| ow? + [eo 50) 


where ¢ is covariantly chiral and in the adjoint representation of the Yang-Mills gauge 


Lagrangian, 


group. In the Abelian case, we can also add an N=2 Fayet-Iliopoulos term, 


Lrin=2 = [eo CoV + (| dO C.o+ he.) 


where (Co, ¢4,¢_) (¢_ = ¢,*) is a constant 3-vector of the SU(2) of the N=2 super- 


symmetry: The 3 scalar auxiliary fields of this N=2 multiplet form a 3-vector of the 


SU(2). Unlike the previous example, this multiplet has all the auxiliary fields needed 
for an off-shell N=2 superspace formulation: Not only do the physical components 
balance between bosons and fermions (4 of each), but also the auxiliary ones (also 4 
of each). 


These 2 N=2 multiplets can be coupled: The scalar multiplet action is modified 


to 


Leg Wo = - | a6 db by + ; / d’6 TT bilo + M)¢;) + h.c. 


where now @, is also a representation of the Yang-Mills group (not necessarily ad- 
joint), with respect to which it is covariantly chiral. However, the same SU(2) matrix 
7 that appears in the mass matrix mij = Mt; now also appears with the N=2 


super Yang-Mills fields, 
Vadi = day +iAGGnty by, 9 =9"Gn 


where G',, are the usual Yang-Mills group generators. (Without loss of generality, we 
10 
0-1 
group, while ¢_) is the complex conjugate.) Note that the mass term appears in 


can choose 7,7’ = ( ); then ¢@4, is some arbitrary representation of the Yang-Mills 
exactly the same way as an Abelian N=2 vector multiplet that has been replaced by 
a vacuum value for its physical scalars. This can also be seen from the commutation 
relations for the N=2 super Yang-Mills covariant derivatives (see below), since the 


scalars appear in exactly the same way as the central charge. 


By our earlier helicity arguments, the only N=3 supersymmetric theory with spins 
< 1 is N=3 super Yang-Mills. The analogous statement also holds for N=4, while 
no such theories exist for N>4. Since theories with N supersymmetries are a subset 
of those with only N—1 supersymmetries, N=3 and N=4 super Yang-Mills must be 
the same: Counting states of supersymmetry representations, we see that this theory 


is the same as N=2 super Yang-Mills coupled to one N=2 scalar multiplet in the 
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adjoint representation (in direct analogy to N=2 super Yang-Mills in terms of N=1 
multiplets). In terms of N=1 multiplets, this is super Yang-Mills plus 3 adjoint scalar 
multiplets. The action then follows from the above results (without central charges 


and Fayet-Iliopoulos terms): 


LsyM.n=4 = ortr - [we Ww? - [es b'or+ (| do ze * 6 [67, OK] + he.)] 


where “J” is a U(3) index. (The U(1) part of the U(3) symmetry involves also a 
phase transformation of the 6’s.) 


For comparison, here are the general (UV well-behaved) actions for all numbers 


of supersymmetries (in D=4): 


Tiga = — ser f ao LWW, +¢ fats ve [ee geo + / 6 f(b) + he 


Ly=2 = —ztr ( / OW? + / d‘6 Voe"0) + / d*0 CoV + ( 7 dO C,o+ he.) 


- [a9 b (eM) yt +i / 26 TT bn(o + M) oy: :s he. 


bigeye = atr - [we Wes [es ee or ¢ d’6 te'/* b7[by, Ox] + he.)] 
where we now use ordinary chiral superfields, making dependence on V explicit. 


As for off-shell N=1 supersymmetry, much information on extended supersym- 
metric gauge theories can be gained by examining the properties of the covariant 
derivatives and their field strengths. In fact, this is more true in the extended case, 
where the “obvious” constraints often imply field equations (which is more than one 
would want for an off-shell formulation). The empty-space covariant derivatives are 
the direct generalization of N=1: Introducing N 6’s as 6’° (and complex conjugate 


6;*), where “i” is an N-valued index with as much as a U(N) symmetry, 


da = (dia; d's, Ong) Cia = Oie —- 140;°Ooa, d's — O's 7 150° Ona 
Re Mat y= dls), rest =0 
, , B 
Exercise IVC7.1 
Find the superspace representation of the extended supersymmetry generators 
(which anticommute with these covariant derivatives). For N=2, include the 


central charge. 
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By definition, extended super Yang-Mills has only spins 1 and less. Dimensional 


analysis then gives the unique result, including physical fields only, 

{Via, Vis} = —GiV 43 

{Via, Via} = Ceatdis 
[Via —1V gal a CiW's 
[Vads V 38] = Copifss a Cx3t fos 

(and complex conjugates of some of these equations). This corresponds directly to 
our discussion in subsection IIC5, where we saw that a general representation looked 
like antisymmetric tensors ¢, ¢', 6”,... of U(N), corresponding to helicities h, h—1/2, 
h—1,... . In this case, h=1, and these helicities come from the surviving on-shell 


components of fag,W'a, 6,..... For N=4 we have self-duality with respect to charge 


conjugation (see also subsection IIC5), 
gid = 1tihlG, 
Exercise IVC7.2 
Analyze the Bianchi identities of these covariant derivatives: 
a Show that for N>2 they imply the field equations. 
b Find a component action that yields these field equations for N=4. 


An interesting simplification of extended superspace occurs for self-duality: Con- 


straining 


Fe wie ge 0 


and dropping the self-duality condition for N=4 (so ¢;; 4 0), we find all commutators 


involving V', are trivial: 
— 7 =, _ 
1 Voy WV at = —674V 5: {V as V at = [V ao V 6] =0 
while all the remaining commutators have a similar form: 
1 Views Via} = Coatdis, [Via —1V ga = CoatW 2, [Vad V 5] = Copifiss 
The latter result suggests we combine the internal and dotted spinor indices as 
A= (4, 7%) 
so that we can combine the nontrivial equations as 


[VAsa, Vee} = tCas fas 
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The former equations then allow us to interpret the remaining covariant derivatives 
Vg as a subset of the SL(2|N) generators that rotate the A index, which form a 
subgroup of the superconformal group (S)SL(4|N). We therefore restrict ourselves to 


the chiral superspace described by the coordinates 
yAa = (2°, 6’) 


The net result is that we have a superspace with no torsion, with coordinates that 


represent half of the supersymmetries as translations and the other half as rotations. 


By comparison with our treatment of the self-dual bosonic theory in subsections 
IIIC5-7, we see that we can extend trivially all our results for the bosonic case to the 
(extended) supersymmetric case by simply extending the range of the indices. In par- 


ticular, we also have a chiral twistor superspace: Extending the range on the twistor 


A 


coordinates z“° used there so A is now an SL(4|N) index, the superconformal group 


is now manifest, and all the methods and results there (e.g., the ADHM construction) 


apply automatically to the supersymmetric case. 
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PART TWO: QUANTA 


Many important new features show up in field theory at the quantum level. Prob- 
ably the most important is “renormalizability”, which states that all the parameters 
(masses and couplings) that appear as coefficients of terms in the action must have 
nonnegative mass dimension (when the massless part of the kinetic term has no di- 
mensionful coefficient). Since the action is dimensionless, [ d*x has dimension —4, 
and the fields have positive dimension, this allows only a small number of terms for 
any given set of fields. This one condition gives relativistic quantum field theory more 


predictive power than any known alternative. 


There are many perturbation expansions that can be applied to quantum field 
theory. One is the mechanical JWKB expansion, which is an expansion in derivatives. 
Of the inherently field theoretical expansions, the simplest is to expand directly in 
fields, or equivalently, in the coupling constants. This expansion is the basis of per- 
turbative quantum field theory. However, this expansion does not preserve gauge 
invariance term by term. On the other hand, the terms in this expansion can be 
collected into small subsets that do preserve gauge invariance. There are three such 
regroupings, discussed in the four following chapters, and they are based on pertur- 


bation expansions: 
(1) the field theoretic JWKB (“loop”) expansion, 
(2) expansions in spin or helicity, and 


(3) expansions in internal symmetry (color or flavor). 
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For the most part, integrals are hard to evaluate, in particular the path integrals of 
exponentials that appear in quantum theory. The only exponentials that are generally 
easy to integrate are Gaussians, and the products of them times polynomials, which 
can in turn be evaluated as derivatives of Gaussians. Such integrals are the basis of 
perturbation theory: We keep the quadratic part of the action, but Taylor expand the 
exponential of higher-order terms. Effectively, this means that we not only expand 
in orders of h to perturb about the classical theory, but also expand in orders of the 
coupling constants to perturb about the free theory. This makes particularly useful 
our analysis of relativistic quantum mechanics (as free field theory). The JWKB 
expansion for the wave function (or S-matrix) expands the exponent in powers of h, 


dividing it onto three qualitatively different parts: 
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(1) negative powers of h (generally 1/h only), which describe the classical theory 
(they dominate the classical limit h — 0), whose physical implications have been 


considered in previous chapters; 


(2) h-independent, where almost all of the important (perturbative) quantum fea- 
tures appear (including topological ones, and quantum breaking of classical sym- 


metries); and 


(3) positive powers, which give more quantum corrections, but little new physics, 


except when summed to all orders. 


These are generally known as “trees”, “one-loop”, and “multiloop”, because of their 


graphical interpretation. 
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In the Schrédinger approach to quantum mechanics one solves a differential equa- 
tion. The Feynman approach is complementary: There one performs an integral. 
Integrals are solutions to differential equations (e.g., f’ = g > f = fg), but usually 
differential equations are easier to solve than integral equations. However, there is an 
important exception: Gaussian integrals are easy, and so are their boundary condi- 
tions. In field theory the most important approximation is one where the integrand 
is approximated as a Gaussian, and the exact integral is evaluated as a perturbation 
about that Gaussian. Of course, solving the corresponding differential equation is 
also easy, but in that case the integral is easier because it corresponds to working 
with the action, while the differential equation corresponds to working with the field 


equations. 


A major advantage of Feynman’s approach is that it allows space and time to 
be treated on an equal footing. For example, as in classical electrodynamics, we 
can solve the wave equation inside a spacetime volume in terms of conditions on the 
boundary of that volume: It is not necessary to choose the spatial boundary at infinity 
so that it can be ignored, and divide the temporal boundary into its “future” and 
“past” halves so that all conditions are “initial” ones imposed at the past boundary. 
It is not even necessary to distinguish between preparation (“if”) and measurement 
(“then”) when describing probabilities: We can instead ask the probability of a given 
wave function describing the whole boundary. This is a particular advantage for 
relativistic quantum field theory, where space and time are more closely related than 
in nonrelativistic theories. We now “review” Feynman’s approach for general quantum 
systems, and quantum mechanics in particular, so that it can be applied without 


further explanation when we come to quantum field theory. 
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1. Path integrals 


Before discussing the path integral approach to quantum mechanics, we first re- 
view some features of quantum mechanics. We can separate the fundamentals of 
quantum mechanics into “kinematics” and “dynamics”: The kinematics are every- 
thing at a fixed time — Hilbert space, preparation/measurement, probability, observ- 
ables. The dynamics are the time development. There are several ways to describe 
time dependence of matrix elements; we will start with a general framework, then 


specialize. 


Time dependence may be associated with either the states (Schrédinger picture) 
or operators (Heisenberg picture). We will be more explicit at first, taking all the time 
dependence out of the states and operators and putting it into a “time development 
operator” U(t,t’) that transforms the Hilbert space from time t’ (earlier) to time t 
(later). For example, if we want to relate an earlier state to a later one we evaluate 
(f|U(t, t’)|2); more generally, we can look at things like 


(f|--O2U (te, t1)O1U (t1, t:) |Z) 


which means to prepare an initial state |i) at time t;, then act with an operator O, 
at time t;, operator QO at time te, etc., and eventually measure the amplitude for a 
final state (f]. 


Now the dynamics can be described entirely through the properties of U. The 


general physical properties it must satisfy are 


causality (locality) : U(t3, te) U (te, t1) = U(ts, ty) 
unitarity: U(t2,t,)'U(te,t1) =I 


Causality tells us that things happen in chronological order: Each event is determined 
by those immediately preceding. It is a kind of group property; in particular, from 
considering t3 = tg we find that 

Vis 1 


We can then write 
U(t+e,t) I —icH(t) 


by expanding in ¢, for some operator H(t) that we call the Hamiltonian. Again 


applying causality, we find 


Ut ‘)—U(t,t’ 
0,U (t,t) = lim UME +6 t) ~ UE #) 
Ee € 
t)-—I 
= (um a) U(t,t’) = -iH(t)U(t, t') 
e—> € 
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which is the Schrédinger equation for U. Again applying causality to build up the 


finite U from products of the infinitesimal ones, 


tf 
U(ty, t;) =e ieH (ty €) ..e ieH (ty t Je ieH (t;) = T {exp -i dt 10 \ 
tj 


which defines the “time-ordered product” 7. Finally, unitarity, another group prop- 
erty, tells us that probability is conserved; in particular, from applying to U(t + «,t), 


H(t)' = H(t) 


The expression of U in terms of a hermitian Hamiltonian guarantees causality and 
unitarity. (It “solves” those conditions.) If H is time independent and we have a 


(orthonormal) basis of eigenstates of H, we can write 


H|I\=E;\1) => U(t,t’) = 2d (ieee 


In Feynman’s path integral approach to quantum mechanics (based on an analogy 
of Dirac), the action is the starting point for quantization. The basic idea is to begin 


with the basic quantity in quantum mechanics, the transition amplitude, and write 


(fli) = f De e's 


where { D@ is a “functional integral”: Integrate over ¢(t) for each t (with some 


it as an integral of the action 


appropriate normalization). The boundary conditions in t are defined by the choice 
of initial and final states. In this subsection we will define this integral in a more 
explicit way by breaking up the time interval into discrete points and taking the 
continuum limit; in subsection VA2 we will study ways to evaluate it using its general 


properties. 


The path integral can be derived from the usual Hamiltonian operator formal- 
ism. Considering for simplicity a single coordinate g, the wave function is given in 


coordinate space by 


wa = (ale), Ww) = f Lola 


where we use the convenient normalizations 


[Bioa=1= [Arvor [lay = ver - 4), wy = virB v) 
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for coordinate and momentum space. To describe time development, we work in 
the Heisenberg picture, where time dependence is in the operators (and thus their 
eigenstates): 


V(q,t) = (4, ty) 


Time development is then given completely by the “propagator” or “Green function” 
_ dq; 
Glay, ty; Gis ti) = (aps tylai ti) = (s,ts) = | = Gay, trig ti) d(G, ti) 
V2 


Exercise VA1.1 
Let’s review the relationship between time development in the Heisenberg and 


Schrodinger pictures. Using the usual relation 


WIQGId = YMHQx() 


between the time-independent states |q) and time-dependent operators Q(t) 
of the Heisenberg picture and the time-dependent states |(t)) and time- 
independent operators Q of the Schrodinger picture, define time-dependent 


eigenstates in two ways: 


(a(t) |(t)) = (al) 
v(q,t) = (alv(@)) = (a, tly) 


Given the time development of a state 


Qla)=ala) = { 


Y(t) = UY) 


(U(t) = U(t,0)), find the development of Q(t), |q(t)), and |q,t), and show in 
particular that |q(t)) 4 |g, t). Which is the eigenstate of Q(t)? 
In general, even for time-dependent Hamiltonians, we can find the infinitesimal 


time development explicitly from the definition of the time derivative and the time- 


dependent Schrodinger equation: 
[20, — H(—70,, ¢,t)](¢,t| = 0 
> (atte = Giél{l — icH[P(),Q(), t]} © (¢, te APOeO4 


and similarly for (p,t + e€| (where P and Q are the Hilbert-space operators). To 
derive the path-integral formalism, we then iterate this result to obtain finite time 
development by inserting unity infinitely many times, alternating between coordinate 


and momentum, 


dpo dq dp, 


telai, ts) = ree 
(ag tglai, te) Dn oe (ap, te 
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.-|pi, t + 3) (pr, t) + 3e|qn, ty + 2€) (qr, ti + 2€|Do, ti + €) (Do, ti + €lgi, ti) 
to obtain successive infinitesimal exponentials, 


dpo dq dp, —te —tE —te —tE —tE 
Tonia ae Bey) (pile |g) (ale |po) (pole |) 


where the time dependence follows from the previous equation. However, note that 
all the implicit time dependence of the Heisenberg picture drops out, because we 


extracted the e~**”’ 


s, putting all the factors of each matrix element at the same 
time: Although each matrix element is evaluated at time € earlier than the one to its 


immediate left, each is of the form 
(a,t + |b, t) = (a, tle *#FPO.COAD 4) = (ale #245) 


(where |b) = |b, t;), etc.), leaving only any explicit time dependence that may appear 
in the Hamiltonian, effectively translating the other t’s — t;. Then we only need to 


know 
(qlp) =e'?", —— (plq) =e "4 
to evaluate the matrix elements in the path integral as 


dpo dq, dp, 


exp{—iqipo + €H (po, %, ti) — MP0 + €H (po, a, ti +6) + mpi t+ €H (pi, mu, ti + 2€) + ...]} 


More explicitly, this result is 


- dp ddn 
,tslai,t:) = | Dp Dg e~*, Dp Da = 
(aystrlaste) = f Dp Da saad A Be 8 Ob 
N-1 
S= S {= (ns — Qn)Pn i €|H (pn, Qn; ty + 2ne) ar Fins Qn+1; ti = (2n ae 1)e)]} 
n=0 


do=G, In = Of) tp —t, = 2Ne 
Note that by adding (or subtracting) a step or two we could just as well evaluate 
(qf, t lpi, ti) or (pe, telai, ti) or (py, ty|pi, ti). 
The classical picture is a segmented path, with the particle traveling along a 
straight line segment from point q, to point g,4; with momentum p,: Each q is 
associated with a point, while each p is associated with the line segment connecting 


two consecutive points. In the “continuum” limit « — 0, N — on, ty — t; fixed, 


tf 
tj 
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(We have dropped some terms in (q|H|p) and (p|H|q) from reordering the operators 
Q and P in H(P,Q) to apply P|p) = p|p) and Q|q) = q\q). These commutator terms 
alternate in sign, combining to give terms of order e?, and can be dropped in the 


continuum limit.) 


More generally, we can evaluate an arbitrary transition amplitude as 


dq; dq; = 
A= (fli) = | Fo Bova lay telat val) = / Dp Dg by*(ape™ ila) 
where now 
N-1 N 
Apso ai, 
Dp Dq= II Pp gq 


V2 ei V2 


Note that we can combine the initial and final wave function, as 


n=0 


Via, gs) =Vs* (aq vila) += A= [ Dq Y(qi,af)e” 


The complex conjugation of =, vs. Y; is due to the complex conjugation involved in 
time reversal (as seen, e.g., when comparing an eigenstate of p at the inital time to 
the same eigenstate at the final time). In field theory, where the “p’s and q’s” are 
functions of space as well as time, if we choose the boundary in space also to be finite, 
so that the space and time boundaries form a single connected and closed boundary, 


then W is simply a function of the q’s over all that boundary. 


We now see the relationship of the path integral approach to the time development 
operator: From the above derivation of the path integral, by integrating back out the 
insertions of unity immediately after extracting the infinitesimal exponentials and 


translating the time of each matrix element to zero, we find 


(aps telai, ta) = (aU (Ey, ti) |Q:) 


U(ty, ti) a ieH(tp—e) | |, e ieH (t; +€) 6 teH (ti) _ {exp -i dt H() \ 
tj 


as previously. This is effectively a Schrédinger-picture expression (all the P’s and Q’s 
are at the initial time), and can also be derived in that picture by solving for the time 


dependence of any state |y(t)). 
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2. Semiclassical expansion 


The path integral formulation is especially suited for semiclassical approxima- 
tions: The Bohr-Sommerfeld quantization rule follows from the fact that the func- 
tional integral is invariant under S — S + 27n, since S appears only as e~*’; in that 
sense the action is more like an angle than a single-valued function. The JWKB ex- 
pansion follows from S' — S/h and expanding in h. This expansion can be interpreted 
as an expansion in (space and time) derivatives, since it leads in the usual way to the 
identification p = —ihO/Ox and EF = iho /dt. 

Exercise VA2.1 

For comparison, we review the Schrodinger equation approach. Consider the 
nonrelativistic JWKB expansion for the propagator (for an arbitrary Hamil- 


tonian /7) to the first two orders in h, writing it as 
Ger pore 


a Show the corresponding orders in the time-dependent Schrodinger equation 
at t > 0 can be written as the classical equation of motion for the action S 
and the (probability) current conservation law for the (probability) density p 
(“Hamilton-Jacobi equations” ), 


H=S, (ose) +p=0 


Oq' Pap; 
when the argument p of H is evaluated at 
_ Os 


(Assume a symmetric ordering of p’s and q’s in the quantum H.) Compare 
the relativistic case examined in exercise IIJA4.1. 


b The propagator is expressed in terms of g and qo, where G(q, qo, t) ~ 6(q— 4) 
at t = 0, so the first order in h is found by using the solution to the Hamilton- 
Jacobi equations to write the classical action in terms of the “final” position 
q and initial position gq. (In principle; in general even the classical equations 
may be too difficult to solve analytically.) However, the Hamiltonian is given 
as a function of p and q. Show that the change in variables from q,p to q, qo 


gives : ° 

OH eS i) ors 
a=, Geo — 

OD; ( ) OqOt (Mis Og,Oq@ 


Show that 
p = det (—i+M) 
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(the “van Vleck determinant” ) solves the current conservation law, using the 
explicit expression for (det M)M~! given in subsection IB3. Check the nor- 
malization, using the initial condition for propagators (or comparing to the 


free case). 


One way to apply the path integral is as follows: (1) Find a classical solution to 
the equations of motion. This gives the leading contribution in h (“stationary phase 


approximation” ), 
[ etS/h ~ e7t50/h 


(The validity of such an approximation with an imaginary exponent will be discussed 
in subsection VA5.) 


(2) Expand about the classical solution as 
? = ba + Vi Ad 
Expanding in A¢@ (or h), we have schematically 


AS -_ ech ots hs! Abt 194 (Ad)? he Al? si Ag)? ate 


6699 


where “)” means to evaluate at @ = dy and the derivatives are really functional 
derivatives (so there is also an integral for each derivative). The first term in the action 
gives the classical contribution, while the linear term vanishes by the equations of 
motion. The quadratic term gives an h-independent contribution to the exponential, 
so the next order approximation to the functional integral comes from integrating 


just that: Integrating Gaussians as in subsection IB3, 
[v6 e t5/P wx @150/F (det Se 


where the determinant is now a functional one, which can be defined by performing 
the functional integral as in the previous section, as a series of ordinary Gaussian 
integrals. The boundary conditions are Ad = 0 at t; and ty (since ¢, = ¢ there). 
Normalization constants can be determined by comparing the free case, or considering 
the limit where the initial and final times converge. 

(3) We then expand the exponential in the cubic and higher terms (positive powers 
of h): The resulting functional integral is that of an h-independent Gaussian times a 
polynomial with positive powers of fh. Since odd orders in A@ vanish by symmetry 


(Ag — —A¢), only integer powers of h appear: 


[%° eiS/h = ein f (Age W487 (: a ae) 
n=1 


A. GENERAL 305 


Polynomials times Gaussians are also straightforward to integrate: The easiest way 
is to first evaluate integrals of Gaussians with linear terms: 


as al §x/2+j? x -1/2,j7S~ 15/2 
amyBR° = (det S)~"/“e 


from shifting the integration variables (x — x + S~+j, etc.) to eliminate the linear 
terms, then using the previous results. In functions of x multiplying the Gaussian, x 
can be replaced with 0/07 (and similarly for z) and then pulled outside the integral. 
(If a linear term is not included, it can be introduced, and the result can be evaluated 
at 7 =0.) The final result then takes the form 


(Qf, trl ti) = 50/2 A +hB+ sa) ee (- > 5) 
n=0 
Exercise VA2.2 
Generalize the above results for integration of Gaussians with linear terms to 


the cases with fermionic and mixed (subsection IIC3) integration variables. 


Exercise VA2.3 


Evaluate 


—z* Sx/2,4 ter 
= Soa 1a valet 


by taking (0/0);)(0/09;)(O/Ojx)(O/Oj1) on the above result. 

As an example, consider the free nonrelativistic particle. The separability of the 
action translates into factorization of the functional integral, so the result can be 
found from the one-dimensional case. As usual, 
wpm @ 


L=-jmi? => era(t)=a%+ 
ty —t; 


(é — &) 


where we have written the classical solution in terms of the variables appropriate to 
the initial and final states, namely x; for an initial state localized there at time ¢;, 
and «x f,ty for the final state. Since the classical action is itself quadratic, so is its 
expansion: 
2 
Lf — Ly . 
S=5)9+ AS, So= ma AS= - fat sm(Az)? 
fou 

In general, a determinant from the AS integral must be evaluated (but see exercise 
VA2.1). In this simple case, time translation invariance, dimensional analysis, and 


independence from x, x; are enough to determine the result of that functional integral 
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up to a constant, fixed by the short-time limit ty — t;. The final one-dimensional 
result is then 


Tem eim(xs—ai)?/2(ty—ti) 


(rp, ts|Z;, tj) = 


where we have used 


(one way of defining a Dirac 6 function) to normalize 
(Lp tltgt) = V 200 (ae — a) 
The Gaussian integral for the free particle can also be performed explicitly, by 


using the discretized Hamiltonian path integral of the previous subsection. 


Exercise VA2.4 
The path integral for the free, nonrelativistic particle can be evaluated much 


more easily using the Hamiltonian form of the action. First consider the 


fore) 
/ dx cipe—ex?/2 
—oo 


as a special case of the Gaussians already evaluated, and use it to derive the 


Gaussian integral 


identity 2 
/ dx e'?® = 2nd(p) 


(The € thus acts as a regulator to make the integral well defined.) Then use 
the discretized expression of subsection VA1, and evaluate the x integrals 
first. All but one of the p integrals then can be trivially evaluated, the last 


giving a Fourier transform. 


Exercise VA2.5 
Consider the one-dimensional harmonic oscillator. (The multi-dimensional 


case is again separable.) 
a Explicitly evaluate the discretized path integral to find the result 


—imuw ae + 2?)cos wt — x2; 
x94 —_—_*+____—_ 


t, + tle;.t;) = 
(mga ts + tla, ts) sin wt sin wt 


b Rederive the result using the result of exercise VA2.1. (Hint: First solve the 
classical equations of motion for x(t), then rewrite it in terms of x; = qo and 
xf = q; plug into So = S and apply the above.) 

Note that we have been sloppy about the definition of the “integration measure’ : 


In going from the Hamiltonian form of the action to the Lagrangian form, we ignored 
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some m dependence. Specifically, if we start with the Hamiltonian form, as derived 
in the previous subsection, and derive the Lagrangian form by integrating out p, we 
find the 1/m in H = p?/2m leads to 


N-1 N-1 
ie dpn 1 dL», MNP Ut. 
n=0 n=1 Y 2m 


The m—-/? then cancels similar factors from the N — 1 x-integrals, while the re- 


maining \/m is that found in the final result above. 


If we had considered a more general Hamiltonian, as in subsection IITA1, where 
p* appeared as 594 (x) piD;, then we would have obtained a measure of the form (for 
Cy ey dD) 

-1/4 nee ae, 
[det g(xo)det g(xy)| II (2n)D/?, det g(an) 

(We have averaged g as g(a)p? > \/g(@n)g(@n+1)p2, since x, is associated with the 
point n while p,, is associated with the link from n ton+1.) Such measure factors are 
easy to recognize, since they are always local, without any derivatives: If we included 


it in the action, it would be a term proportional to 
In | | det a(e,)= 1\ve In det g(x) ~ 4(0) [a In det g(x(t)) 


(The factors at zo and xy are for standard normalization of the wave functions, which 
we can absorb by a redefinition.) In practice we just drop all such factors throughout 
the calculation, and fix the normalization at the end of the calculation. Since the 
Lagrangian form follows from the Hamiltonian form, which was properly normalized, 
we know such factors will cancel anyway. Auxiliary fields can require similar factors 
for proper normalization; then such factors are simply the Jacobians from the field 


redefinitions from a form where they appeared with trivial quadratic terms. 


3. Propagators 


The amplitude we defined by path integration in subsection VA1 is the “propa- 


gator” or “Green function” for the Schrodinger equation. Explicitly, we define 
G(q, t,t) = at —')(g, tq, #+) 


where we have included the “step function” @(t — t’) (1 for t > t’, 0 otherwise) to 
enforce that the final time is later than the initial time (retarded propagator). This 


satisfies the free case of the general defining equation of the propagator 


[O; =F 1H (-i0,, q, t)|G(q, t; qd’, t’) = [—Oy = tH (tidy, qd, t)|G(q, t; Gs i) 
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= V2n6(q — 7/)5(t — t’) 


where we have used 
d,0(t — ’) = 6(t —t’) 


and the facts that G without the 6 factor is a homogeneous solution of the Schrodinger 
equation (no 6’s) and becomes a 6 in x for small times. The propagator then gives a 


general solution of the Schrodinger equation as 


(q,t| = {= aid.tyi¢t| = vat )- | he (aid, tv”, tv) 


In particular, for v(q, t’) = V276(q — q’') at some time t’ for some point q’, w(q,t) = 


G(q,t;q,t’) at all later times. These equations are matrix elements of the correspond- 


ing operator equations; e.g., 
G(q,t:7,t) = (Ut, #)|q/) 


(A, +H (OIU(e,¢) = UE, #) [Oy + 4H (E)] = Ot — LI 


where we now include a step function in the definition of the time development oper- 


ston . 
U(t,t’) = a(t —t)T {exp i [ dt (0) \ 


This solution for the propagator is not unique; as usual, a first-order differential 
equation needs one boundary condition. Another way to say it is that the inhomoge- 
neous differential equation is arbitrary up to a solution of the homogeneous equation. 
We have eliminated the ambiguity by requiring that the propagator be retarded, as 
incorporated in the factor 6(t — t’); using instead —0(t’ — t) would give the advanced 


propagator. 


This has an interesting translation in terms of the Fourier transform with respect 
to the time, which replaces the so-called “time-dependent” Schrodinger equation with 
the “time-independent” one. Fourier transforms are a useful way to solve differen- 
tial equations when performed with respect to variables with translational invari- 
ance, since this implies conservation of the conjugate variable: The result is elimi- 
nation of the corresponding derivatives. In this case, it means the time-independent 


Schrodinger equation needs a time-independent Hamiltonian. For example, defining 


dt dt’ 
V Jon V/ Jon 


=> -i(E—H)U(E,E')=5(E-E')I 


U(E, E’) = ee a. t') 
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~ Q 


=> UE L')= B 


File - E’) 


Now inverse Fourier transforming, 


dE dE’ 
V2 27 


= [yer y 
20 E-—H 


we have an ambiguity in integrating E past the pole at E = H. We therefore shift 


U(t,t’) = eH) (EB) 


the pole slightly off the real axis, so we can integrate exactly on the real axis. Closing 
the contour by adding to the real axis a semicircle of infinite radius in either the 
complex upper- or lower-half-plane, wherever convergent (lim)... e|Fl — 0, but 
lim4+00 €* #4! = 00), we find 
dE _, i 
ett 


— tL 4 (44) e-tHt 
i B= Lie ave 


which gives either the retarded or advanced propagator depending on the choice of 
sign for the infinitesimal constant € (retarded for E—H+ie). Remember from exercise 
ITA1.2 that complex integration is essentially just Gauss’ law, with poles acting as 


charges: The general integral result we used is 


where the counterclockwise contour of integration encloses the pole at z but no sin- 


gularity in f, so we can evaluate the integral by Taylor expanding f about z. 


t<0 «4 |E 
yo ie 


N 


/ _ 
! e A+i€ \ 
: *H-ie / 
AN / 
NC oe 
~<«- 
t>0 


To perform the inverse Fourier transform, we note that the exponent needs an 
infinitesimal negative part to make the integral convergent: 
_ 4 
 E-Htie 


/ dt 'F*(4)0(4t)e HFA 


310 V. QUANTIZATION 


Exercise VA3.1 


Show that 
i i 


rte x-t 


= 276(2) 
by three methods: 
a Use the above result for the Fourier transform. 


b Show that this is the contour integral definition of the 6 function, which is 
actually a distribution, by integration, multiplying by an arbitrary (nonsin- 
gular) function and integrating along the real axis. (Hint: Push the poles 
onto the real axis, shifting the contours along with them, to find the integral 


of a single function along the difference of two contours.) 


c Prove the identity (checking the normalization) 


lim 
e0 x? + €2 


= 276(z) 
For the example of the free particle in one dimension we found by various methods 


G(a, t: x’, t') = A(t = t') a ei(x—2')? /2(t—-t') 


However, we could have saved the trouble if we just started in momentum space, 
A —i(t—t! —i(t—t!)p? /2m 
G(p, typ, t') = (pIUE,¢) |p’) = (plO(t — te 4 Ip!) = OE — te PP (pip!) 


= O(t _ t') /276(p _ pie rien 


in the retarded case. If we Fourier transform p to x (the same as a change of basis 
from |p) to |x)), the integrals are then simple Gaussians. Again, the result is simpler 
in p-space because p is conserved. In the relativistic case we will want to treat energy 


and momentum equally; doing the same here for later comparison, we define 
~ dq dt 


W(p, E) = 1 Ya AC t) 


and similarly for G, and we have 


~ a 
G(p, E;p’, E’) = Pon 2nd(p — p')O(E — EB’) 


4. S-matrices 


“Scattering” is defined as a process that starts with a free state and ends with 
a free state, with interaction (self- or with external forces) at intermediate times, 


e.g., particles coming in from and going out to spatial infinity and scattering from a 


A. GENERAL 311 


potential of finite spatial extent. Thus, if the interaction is nonvanishing somewhere 


between times t; and tz, where ts > tg > t; > t;, we can write 
UGah) =UGA DUG Uh Ser Uae Oe 


in terms of the “free term” Hp of the Hamiltonian H = Hj) + V, where V is the 
“interaction term”. (V may be time dependent, but not Hy.) It is more convenient 
to work with a quantity that is independent of initial and final times (as long as they 
are outside of the interaction region t, to ty). We therefore define the “S(cattering)- 
matrix” operator S as 


S= lim e*toU(t,, te 
t;——0o 
tp—too 


where we have thrown in the limit because in the real world interaction doesn’t just 


start and stop, but fades in and out. However, in our simple example above we find 
S= aT (ty, ine 


In the special case of a free theory (V = 0), we have simply S = J. 


In the interacting case, the amplitude we get from the path integral is the inter- 
acting propagator. To be able to take the limit describing time development between 
infinite initial and final times, we need to choose boundary conditions such that the 
initial and final basis states have the time dependence of free particles, described 
by Ho, assuming that the particle behaves freely at such asymptotically large times. 
This is called the “interaction picture”, to distinguish from the Heisenberg picture, 
where the states have no time dependence, and the Schrodinger picture, where the 


states have the complete interacting time dependence. We thus evaluate the limiting 


amplitude 
; dq dq 
A= lim (by(tp)|di(ti)) = lim, Ta ORI as tar, ty, te) sdb) 


tf—too tf—too 


for the interaction-picture states |(t)), relating the interaction-picture coordinate 


basis o(q,t| to the Heisenberg-picture basis (q,t| (with initial conditions 9(q,0| = 
(q,0| = (al): 


te = (OR trade) = oar tsletsU (ty, tie qi, ti) o 


V(t) = ol, tld) = A= (w,|S|¥i) 


with S as defined above. 
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The fact that time development conserves probability (H = HT") is reflected in 


the corresponding unitarity condition for the S-matrix: 
SIS =1 


A more complicated condition is causality: The basic idea is that interactions take 
place in chronological order. (A stronger statement of causality will be found in the 
relativistic case: that any interaction should take place at a spacetime point, rather 
than just at a single time. It follows from this weaker one in relativistic theories, 
since event B is later than event A in every Lorentz frame only when B is in A’s 
lightcone.) Causality is the condition that the Hamiltonian at any time involves only 
variables evaluated at that time. (H(t) is a function of only ¢(t), all at the same 
time t, where ¢ = p,q are the quantum variables appearing in the Hamiltonian.) A 
nice way to describe the interactions is by introducing a classical background as we 
did for the semiclassical expansion of path integrals, such as by ¢(t) — ¢(t) + x(t), 
where y is just some function. The important point is that we have shifted ¢(t) by 
x(t) at the same t, so as not to disturb causality. We then consider the effect on the 
S-matrix of modifying the background y by a function dy localized (nonvanishing) at 
some particular time t, and a function dy’ localized at t’, such that t > t/. Picking 


out the dx pieces in the time-ordered product, we can therefore write 
Slx + 5x + 6x'] = UF, HVOUEL) VEU, d) 


Sly + dy] = U(f, DVHU(E, LU, 2) 
Sly + 6x] = U(F, HUE E)VE)U(E, i) 
S{x] = U(f, UC )U(t, 7) 


where U(t’,7) is the time-development operator from time ¢; to time t’ (including the 
canceling factor with Ho), V(t’) is the extra factor in the time development at time 
t’ resulting from the function 6x’ localized there, etc. Thus we replace a VY with the 


identity if the corresponding dx is absent. Then we easily find 
Slx + 6x + dx'] = S[x + dxJS"[xIS[x + 6x'] 
=> (S'[x + dx]S[x + 4x + 6x/] — 1D) — (S"[xIS[x + 6x] - 1) = 0 
= aa (stu Shi) =0. fori S72 


using the infinitesimal functions 6x and 6,’ to define functional derivatives (as in 
subsection IITA1 for the action). 
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In general, it is not possible to solve the Schrodinger equation for the propagator 
or the S-matrix exactly. One approximation scheme is the perturbation expansion in 


orders of the interaction: 


H= Hyp Vv = T(e =F ERY —i(t— sone gf dt e~ L(t p— Hol iV (t)Je —i(t—t;)Ho 
ti 


tr t rer 7 
a. i at f dt! etter) Hol _iV(t)Jett-* )Ho [—iV (t')]e“ —t;)Ho ae 
ty ty 


=> Spi = (f|S|t) = (ft) + +f dt (f,t|[-«V (€)]]é, ¢) 
+f af dt! (f, t\[-iV (Je) [iv (#')] 3, t') + 


(To get this result, look at the definition of the time-ordered product in terms of 
infinitesimal integrals.) The first term in S is just the identity (i.e., the free piece). All 
the other terms consist of a string of interactions (—iV ) connected by free propagators 
(e~40, where t is the time between the interactions), with each interaction integrated 
over all time (subject to time-ordering of the interactions), and the initial/final state 


(wave function) evaluated at the initial/final interaction time. 


Exercise VA4.1 
Assume the initial and final states are eigenstates of the free Hamiltonian: 


H|t) = Ej), Ao|f) = E;\f) 


Assuming V has no explicit time dependence, explicitly evaluate the time 
integrals in the S-matrix, effectively Fourier transforming from time to energy, 
to find 


Spi = (ft) — 20d (Es — E,)(f(E — Ho) a F — Ho) |*) |=n, 


(Hints: Redefine the integration variables to be the times between interac- 


tions. Taylor expand 1/(£ — H + ie) in V for comparison.) 


In field theory we want to express any state in terms of a basis of products of 
1-particle states, so we can calculate the behavior of these specified particles. We 
try to do this by using field variables (the “q’s” of field theory): Each field operator 
should produce a single particle. Unfortunately, this is not the case: An asymptotic 
state of given 3-momentum created by such a field operator is not necessarily an 
eigenstate of the energy, because such a state can be either 1-particle or n-particle, 


due to interactions. The propagator for the field is then of the form 


Gp, tpt) ~ 6p — p') Ser ybr(p)e OO 
I 
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NT nYI 
Ex(p) =)_ Erie), Dopi=p 
j=1 7=1 


where “E7,;(p;)” is the energy of a 1-particle state (the 5), will include an integral in 
general). However, as long as all particles have masses, such an asymptotic 1-particle 
state is distinguishable as that of lowest energy Ep: The higher-energy states are 
n-particle states to which this particle can couple. (If some of the n-particle states 
were lower energy, the 1-particle state could decay into them, and thus the 1-particle 
state would be unstable, and not asymptotic. With massless particles things are more 
complicated: Then 1-particle states are more difficult to define and to measure.) In 
principle, we could define the 1-particle states by constructing the corresponding 
operator, consisting of the field plus terms higher order in the fields; in practice, this 
is rather complicated. (Note: For the above analysis, it might be convenient to use 


the center-of-mass frame.) 


A simpler way to make the asymptotic states unambiguous is by modifying the 
definition of the S-matrix: 


_; f'f 
S= lim eMsbo7 (« ae . ") eae 


ti>—00(1+ie) 

t p+00(1+ie) 
introducing factors of 1 + ie for some positive €, which may be chosen small for 
convenience. (Actually, we can generally replace 1 + ie with just 7 if it is not too 
confusing: The result is the same.) The effect is seen by considering a matrix element 
of particular fields that may be a superposition of different energies FE in the initial 
state and E’ in the final state, but evaluated between an initial state of energy EF; and 
a final state of energy Ey (which might not be equal for a time-dependent interaction, 
e.g., if the number of particles changes). Since E > £; initially and E’ > Ey finally, 


the time dependence of any such matrix element is proportional to 


1 for £=5,E = E} 


Shi lim cits (E’—Ej) lim ec #(E-Ei) — 
) 0 otherwise 


t¢—+o0(1+ie) ti; —00(1+%e 


Alternatively, we can simply impose E = E;, E’ = Ey directly in the definition: 


; at! ae 
S = lim On (ty), HT (< I, ") Si(t),Hoe ° 


tfp—too 


where the free Schrodinger equation Hp = E; or Ey defines FE; for the initial state 
and EF for the final state, and 04,4, is evaluated by examining the asymptotic time- 
dependence of the time-development operator with respect to ¢; and tr: Normally 


field theory is calculated in energy-momentum space, working with the spacetime 


A. GENERAL 315 


Fourier transform of the above, where this amounts to simply comparing energies 
f= bf > Ep. 

If we know some details of the interaction, this modification may be irrelevant: In 
particular, in local quantum field theory interactions happen at a point in space and 
time. For example, consider the inner product between a 1-particle state in its rest 
frame and a related n-particle state, which appears in the same propagator. Because 
of locality, the wave function for the n-particle state, when evaluated in position space 
(which is where the theory is local) is simply the product of n 1-particle wave functions 
evaluated at the same point. But we know that for small relative momenta (where a 


nonrelativistic approximation holds) that the individual wave functions propagate as 
lw] ~ fe- ep P VP 


from the form of the free 1-particle propagator. (Or, we can use dimensional analysis, 
and consider the spread of a particle of restricted range of momenta from a confined 
region: Then ||? ~ 1/V and the volume V ~ |t — ¢/|?~1.) This implies that the 
n-particle wave function will fall off as the nth power of that, so in the limit of large 
times the 1-particle state will dominate. In a relativistic theory the length scale 
associated with this fall-off will be associated with the masses involved, and thus at 


a subatomic scale. 


5. Wick rotation 


In the previous subsection we ensured convergence in the definition of the S-matrix 


by effectively making the “coordinate change” 
t > (1—‘e)t = e*t 
in the definition of the limit 
(l-ie)t 09 => t—(1+ie)oo 
This affected the time-development operator as 


—iHt _, ,—iHt—et 


e€ e€ 


for H > 0 to pick out the ground state H = 0. The same effective substitution was 
made in subsection VA3 in defining the contour integral for the propagator: 


/ GE -maige_* [GE ime 
Om E—H On (l+i)E—H 
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dE ine y dE in y 
20 E — (1—ie) 20 E-H+te 


which is the same as the substitution 
E = (1+ie)E = e*E 


(when working with the time-independent Schrédinger equation) since essentially 
E =i0/0t. 

In general, having to do contour integrals and keep track of ie’s in propagators is 
inconvenient. Fortunately, there is a simple way in practical calculations to get rid 
of not only the ie’s but (almost) all the other 7’s as well. The method is known as 
“Wick rotation”. The basic idea is to extend the above complex rotation from angle 
€ to angle 7/2: 

is =tpSe C$" EsiE 


pushing the contour even farther away from the singularities. Thus, the Schrodinger 
equation is changed to a “diffusion equation” (to describe, e.g., Brownian motion): 


((-H)yb=0 > (+H)b=0 


For example, for the free particle the resulting equation has no i’s. The time- 


independent Schrodinger equation then becomes 
(E-H)b=0 => (FE -H)W=0 


The result for the propagator is then 


“dE ig 1 —Ht 
|, eee: 
[. a «aap 


Now no 2e prescription is needed, since the pole was moved away from the real axis. 


Similar remarks apply to the inverse Fourier transform 


i dt eae = = 


co 


1 
H=ik 


Exercise VA5.1 
Find the Wick-rotated retarded propagator G(a’,t’;z,t) for the free (1D) 


particle, satisfying 


(+ H)G = (—dy + H')G = V27r6(a — 2')5(t — t') 
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Furthermore, if we define the S-matrix directly in this Wick-rotated space 


aft 
S= lim oT (¢ ie ") e tio 


ty — 00 


tf—too 
then the limiting procedure is unambiguous even in field theory, since 


B-By) _ ‘6 for E=E; 


lim et 
0 forE>E,; 


tj—7—00 
1 for K=Ey 
0 for E> Ey 


tf—+oo 


lim @-9-*7) = { 


Another important effect is on actions. For example, in the mechanics path 


integral for a particle with kinetic term T = 3mz? in a potential U(x), we integrated 


oes SS paw —T) 
Upon Wick rotation, this becomes 
es s= fau+n 


The major change on the exponent —S is that it is now not only real, but negative 
definite. (For physical purposes, we assume the potential has a lower bound, which 
can be defined to be nonnegative without loss of generality.) Thus, the semiclassical 
approximation we made earlier, called the “stationary phase” approximation, has now 


-5/h to a Gaussian, 


become the “steepest descent” approximation, namely fitting e 
which is approximating the integral by the places where the integrand is largest. We 


thus write 
S(a) = S(ao) + 3(@ — 2)?" (xo) + «.., S'(t%p) =0, S" (x9) > 0 


for one variable, with the obvious generalization to many variables. Explictly, we 


then have 


dr .-s@/n x, 1 _,-8(0)/n 


Vonh Sa) 


plus higher orders in h, expressed in terms of higher derivatives of S. In the case 


S'(x)=0 


of many variables, S” is replaced with a determinant, as for the Gaussian integrals 
of subsection IB3, and for functional integrals, with a functional determinant. (But 
sometimes the functional determinant can be replaced with an ordinary determinant: 
See exercise VA2.4.) 
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So now we can first calculate everything in Wick-rotated spacetime, where every- 
thing is real (more precisely, classical reality properties are preserved quantum me- 
chanically), and then Wick rotate back to find the correct result in physical spacetime. 
In particular, the appropriate e¢’s, still needed to correctly position the singularities 


in physical spacetime, can be restored by rotating back through an angle 4m —€: 


inverse Wick : to (G+etHiP-, ES (-iteE =e Ff 
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Classically we distinguish between particles and waves. This can be consistent 
with a classical limit of a quantum theory if there is a conserved charge associated 
with the classical particles, with respect to which the classical waves are neutral. Such 


a situation is described by a field theory Lagrangian (density) of the form 


L = Y'O()v + Lo(9) 


where w is the field of the charged particle, and ¢ the field of the waves that carry 
the interaction. O includes the kinetic operator; a nonrelativistic example was given 
in subsection HIA3. Thus O(¢)w = 0, the field equation for ~, is also a Schrédinger 
equation, which we can derive from a classical mechanics action. (A zero-range inter- 
action, as in billiard-ball collisions, is described by an Ly without derivatives.) Then 
we can have continuous worldlines for the particles: The statement that the worldlines 
do not end or split is associated with charge conservation. The interaction between 
the particles and waves is described by @ dependence in the particle (mechanics) ac- 
tion obtained from O (and not the term Ly for the wave fields). If we look at just 
the mechanics action, the modification is the same as considering external fields (like 
external potentials in nonrelativistic mechanics), since we are ignoring Ly, which is 


needed for the field equations of ¢. 


Ly then can be added separately. Coupling to such external fields is a simple 
way to study properties of particles without applying field theory. For example, in 
nonrelativistic mechanics it helps to explain charge and spin, which don’t appear 


explicitly in the free Schrodinger equation. 


1. Particles 


All the information in quantum mechanics is contained in the propagator, which 
gives the general solution to the Schrodinger equation, and can be obtained by the 
Feynman path integral. Here we discuss the free propagator for the spinless particle 
(whose classical description was given in section IIIB), which is the starting point for 


relativistic perturbation theory. 


We consider quantization first in the Lorentz covariant gauge v = 1. From sub- 


section IIB2 we have 


T 
Sani | dt[|—2""Dm + 1(p? + m2)] 
0 
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Except for the T integration in the functional integral (in addition to the functional 
integration over x and p), the same methods can be applied as in the nonrelativistic 


case, where we had : 
f ; 
SHNR = / dt(—4'p; + =p”) 
ti 


The simplest expression (and ultimately the most useful one) is obtained by Fourier 
transforming with respect to x: In comparison to the multidimensional nonrelativistic 
result 

Galp'.tp",t) = 6(p — pat — te” 
(where here 6(p — p’) = (22)"P-)/26-1 (pt — pl") for D — 1 spatial dimensions), the 
relativistic result is 


G(p, p’) = fl dT 5(p — p')0(T)e wm )/ 


(where now 6(p — p’) = (21)?/?6?(p* — p!*) for D spacetime dimensions). 
There are several simple yet important differences from the nonrelativistic case: 


(1) The dependence on the mass m is different. In particular, we can set m = 0 only 


in the relativistic case. 


(2) There is an additional integration [ dT’, because the variable T, which is the 
remaining part of v, survives the gauge v = 1. (It is all that remains of a 
would-be functional integral over v.) This is analogous to the time integral in 
the nonrelativistic case for G(p', E;p", E’), if we set the energy to zero. This 
is as expected, since the relativistic classical mechanics differs from the nonrel- 
ativistic one mainly by constraining the “Hamiltonian” 5(p +m?) to vanish. 
This interpretation also leads to the “zero-energy” version of the inhomogeneous 


(proper-)time-independent Schrédinger equation for this case, 
—i,(O — m’)G(z, 2’) = 6(x — 2’) 


(3) The propagator is automatically “retarded” in the “proper time” 7’, as a conse- 
quence of the positivity condition v > 0, which was motivated by the geometrical 


interpretation of v as the worldline metric. 
When used in this manner to write the propagator in terms of a Gaussian, T is known 
as a “Schwinger parameter” . 


Generally, it is convenient to remove the momentum 6-function (which resulted 


from translational invariance) as 


G(z,2')= A(x—2') => G(p,p') = 6(p— p')A(p) 
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AGy= / dT (Tem)? 


where we have simply written A(p) for the Fourier transform of A(x) (dropping the 
tilde). Performing the T integral, using the same methods as for the t integral in the 
nonrelativistic case, we have the final result 

—1 


MP) = TERT md ie) 


Actually, this result is almost obvious from solving the relativistic wave equation. The 
only part that is not obvious is the “ze prescription”: how to perform the contour 
integration upon Fourier transformation. In the nonrelativistic case, we saw two 
obvious choices, corresponding to retarded or advanced propagators; the classical 
action did not distinguish between the two, although the retarded propagator has the 
obvious convenience of determining later events from earlier ones. On the other hand, 
in the relativistic case the choice of propagator was fixed from classical considerations. 


T is restricted to be positive, and the ie is needed to make the T' integral converge. 


Exercise VB1.1 
Take the nonrelativistic limit of the relativistic propagator, and compare with 
the propagator of nonrelativistic quantum mechanics. Explain the difference 


in terms of the nonrelativistic limit of the classical mechanics action. 


Exercise VB1.2 
Perform the analysis of exercise VA2.4 for the relativistic particle. First re- 
place the integration over T by a sum: Instead of dividing up the time into 
2N intervals of length € and taking the limit N — oo,¢ — 0, with 2Ne fixed, 
sum 2€ )7y_», and then take the limit « — 0. (2Ne is now T instead of tp —ti, 
and we integrate over it instead of keeping it fixed.) Perform all x integrals 
and then all but the last p integral before summing over N. Again, the entire 
calculation is much easier than using the Lagrangian (second-order) form of 


the path integral. 


To understand this point better, we examine the Fourier transformation with 


respect to time. In contrast to the nonrelativistic case, there are now two poles, at 


p=tw, w= V/(pi)?+m?: 4-2 (a - ee ) 
DP 


Ww w—te) p+ (w-—ie) 


where now p* = (p°, p’). These are also the two classical values of the canonical energy 


(as opposed to the true energy, which is the absolute value), which we saw previously 
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corresponded to particles and antiparticles. With our prescription for integrating 


around the poles, using the same methods as in the nonrelativistic case, we then find 
Ket os ; 1 _. ' 
G(p', t; p", ') = (Qn)? 5? “(p' = per | 
Ww 


a! Lae 
= (2nyPP5P*(pi — pl) —[O(t — He WO) + O(t — tee) 


We now see that the particles (p° = w) have a retarded propagator, while the antipar- 
ticles (p° = —w) have an advanced propagator. This is the quantum version of the 
classical result we saw earlier, that particles travel forward in time, while antiparticles 
travel backward. The interpretation is simple: When evaluating matrix elements of 
the form (f|O|i), the resulting propagator ensures that the initial wave function con- 
tains only positive energies, while the final complex conjugate wave function contains 


only negative energies (i.e., the final wave function itself contains positive energies). 


We next compare quantization in the lightcone gauge. Again from subsection 

ITIB2, : 
Susc= [ drlé-p* - 2p! + Mp? +m?) 
Whereas in the covariant ees the analog to the nonrelativistic time t was the 
“proper time” 7’, the analog is now the lightcone “time” +. Since tT = a*/pt, we 
have E = p-pt (EF = 10/07 vs. p- = 10/0x*), and thus 
i —i 
20) = Bp em) tie Teme) 

as before. Note that this derivation was almost identical to the nonrelativistic one: 
Unlike the covariant gauge, we did not have to add in T as a separate variable of 
integration (but not path integration). However, this Schwinger parameter is useful 
for evaluating momentum integrals and analyzing momentum dependence. This is a 
typical characteristic of unitary gauges: They are more useful for keeping track of 


degrees of freedom. 


2. Properties 


As in electrodynamics, the free scalar satisfies a differential equation second-order 
in time, so the propagator is used differently from nonrelativistic quantum mechanics 
to give a general solution to the wave equation. We begin by considering a free 
“action” between two different scalar fields, written in a way where all derivatives act 
on just one field or just the other, i-e., where the field equation is explicit. The two 


forms are related by integration by parts, but now we keep boundary terms: 


pers [A(O — m?)B — B( — m?)A] = per @- (AB) = parton Ad mB 
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where in the last step we have used the (generalized) Stokes’ theorem (see subsection 
IC2); “$d?-'o”™” is the integral over the closed surface bounding the volume inte- 
grated over in f d?x. In practice we take the volume to encompass all spacetime in 
the limit, neglect the part of the boundary at spacelike infinity, and choose the parts 
of the boundary at timelike infinity to be surfaces at constant time, so the boundary 


integrals are over just space: 
parton Ad mB = pars Ad, BI, = pars Ad: Blo — pars Ad;,B\ v0 


We then have the solution for the wave function inside the volume in terms of that 


on the boundary: 


(O-—m?)p=0, —i4(O - m’)G(z, 2’) = —i4(O! — m?)G(a, 2’) = 6(ax — 2’) 


where the wave equation for is e Klein-Gordon equation. 


Exercise VB2.1 
For a free nonrelativistic particle, solve x(t)’s ” 1D wave equation” for a Green 
function that vanishes at t; and ty (“Dirichlet” boundary conditions). Use it 
to find the solution for x(t) in terms of t;, tp, 2;, and wf as given in subsection 
VA2. (Don’t forget the sign from the orientation of the “boundary”, i.e., 
P= 4% or tp.) 


Similarly, this defines a conserved current from any two wave functions 
A+ (W1* Oye) = o1*(O — m7) — Yo(D — m7) q* = 0 


or, evaluating the integral over a volume infinite in space but infinitesimal in time, 


the conserved charge 


d 
dt 


This leads to the covariant inner product ( || ) on a hypersurface (as opposed to the 


ds b1* Oe = =0 


naive inner product ( | ) for the full space) 
q?- ly 
(il) =e") f oo 


(27) PP wy" Lidyby 


where the e(p°) appears because the contour integral gives a + at later times (positive 
energy) and a — at earlier times (negative energy). Explicitly, we find for the inner 


product of plane waves 


bp(x) = (x|p) = e'?* 
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(2n)P/2-15P-1(p' — p*)e(p°) 4 (p° + p®) 


We have used p* + m? = p® + m? = 0, which also implies that |p°| = |p|: Thus, 


=> (p\|p') = 


the inner product vanishes if the waves have opposite-sign energy, while for the same 


sign €(p°)$(p° + p”) = |p®|. The result then can be written manifestly covariantly as 


O(p — p’) 


ed ae 3 eee 0,70 z)P/2-1gP-1 (pi 
Ind[(p? + m?) = O(pip” )w(2n)i 0 (p 


(p||p’) = =p) 

Similarly, the solution for the wave function in terms of the Green function gives 
only positive-energy contributions from the part of the surface at earlier times, and 
only negative-energy contributions from the part of the surface at later times. More 
general on-shell wave functions, since they depend on only D — 1 spatial momenta 


and the sign of the energy, can be written as a restricted Fourier transform 


w(e) = f dp 2x5 (40? + m’)]e”"G0) 


> (12) = f dp 2n5[3(p? + m?)jda(9)* dal) 


(Here 7)(p)* means to complex conjugate after Fourier transforming to p-space; oth- 


erwise, we need to change the sign of the argument.) In particular, for a plane wave 


d(p—p') 
276[5(p? +m?)] 


we have 


Up (p) = 


It will prove useful later to have a collection of solutions to the homogeneous and 
inhomogeneous Klein-Gordon equations, and compare them in 4-momentum space 
and time-3-momentum space. Using the previous nonrelativistic and relativistic re- 


sults, we find 


A: 


—iw|t| 


—i/(p? + m? — ie) => oije™ fais t)e™ = £€ 
A*: i/(p? +m? + ie) => He" 40(je“ = elt 
Ar: —i/(p?+m?—iep®) => O(t)e* —O(t)e™ = —2i0(t)sin(wt) 
Aa: i/(p?t+m*+iep®?) = 6(-te?-O(-t)e™? = 2i0(—-t)sin(wt) 
A,: O(p)276(p2+m?) => $= O(t)e"** +6(-the™? = et 
A_: 6(—p°)276(p? +m?) = O(t)e* +6(—-t)e = ~ 


where we have omitted certain common factors (see subsection VB1). Ax satisfy 


the homogeneous equation, while the rest satisfy the inhomogeneous one. (These are 
easily checked in the mixed space, where the Klein-Gordon operator is —(0? + w).) 
This table makes explicit which sign of the energy propagates in which time direction, 


as well as the linear relations between the momentum-space expressions. In particular, 
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we see that A, propagates just the positive-energy states, while A_ propagates just 


the negative-energy ones. 


Exercise VB2.2 
The relativistic propagator uses a particular choice for integrating around the 
two poles in the complex energy plane, as encoded in the ze prescription. If 
we ignored the classical determination of that prescription, there would be 


four simple choices, integrating either above or below the two poles. 


a Show these four choices can be enforced by replacing ie in p? + m? — ie with 
ie, —ie, iep®, —iep® 
and derive the results of the table above. 


b Give explicit expressions for the four propagators in position space in four 


dimensions for the massless case. 


We can check the propagator’s behavior by explicit evaluation, using plane waves: 


) f LOE ale — 2), p(x") = p46" + 19) De HN vet 
e(p QnppR x — 2')510¢'bp(a') = e(p’) a(p + 10) —e e 


= O[p*(t — t')vp(2) 
where we have used the previous result for G(p,t;p',t’) (and thus A(j,t)). Again 
we see that the propagator propagates positive-energy solutions forward in time and 


negative-energy backward. 


This propagator also applies to relativistic field theory. (See subsection HIA3 for 
nonrelativistic field theory.) In comparison to the nonrelativistic case, the propagator 
lt > 


is now —i/5(p? +m?) instead of —i/(;+p? — E), and this determines the kinetic term 


in the field theory action: 
y= - [ac 5¢3(O —m?)¢ = ju 5[(0¢)? + m?¢7] 


To make the functional integral of e~’°° converge, we replace m? — m? — ie, which 
is the same ie prescription found in first-quantization. Note that we have used a real 
field ¢* = ¢. (A complex field can be used by doubling = (¢, + i¢2)/V2.) This is 
possible only in the relativistic case because we have both positive-energy solutions 
et as well as negative ones et’”', (In other words, the relativistic Schrodinger 
equation is a second-order differential equation, so we get two i’s to make the kinetic 
operator real.) Reality simply means identifying particles with antiparticles. (F.g., 


there is no “antiphoton” distinct from the photon.) 


326 V. QUANTIZATION 


3. Generalizations 


More generally, we will find propagators of the form (in momentum space) 


i 
A=-—, K=Kk' 
K 


’ 


corresponding to free actions 
5 = / dx 46K 


where K = —i(0 — m7) in the case just considered. (We have neglected the ie in A, 
which destroys its naive antihermiticity.) Then the inner product is defined as above 


in terms of the Green function by again using integration by parts, 


i / d?x [(KA)'B — A'KB] = €(p°) ¢ dP—15™ A'M,,B 


to define the operator M,,, which was <(p°) Sim in the previous case. (For the usual 
equal-time hypersurfaces, we use My = —M°. There may be additional implicit 
matrix factors in the Lorentz-invariant inner product A'B.) This in turn defines the 
inner product er 
(12) =f oe il Mat 
and thus D-igim 
V0) =o") § Gore Clea) Moe) 

This inner product gives a nonnegative norm on physical bosonic states, but on physi- 
cal fermionic states it is negative for negative energy, because ordering the initial state 
to the left of the final state (the wrong ordering for quantum mechanics) produces 
a minus sign from the anticommutativity of the fermions. (From the explicit inte- 
gral, this appears because K is generally second-order in derivatives for bosons, but 


first-order for fermions, so M,,, has one factor of p® for bosons and none for fermions.) 


For physical fields, the (free) field equation will always imply the Klein-Gordon 
equation (after gauge fixing for gauge fields). Thus, the propagator can always be 
written as 

A=-5= “ipo 

5(p? + m2 — ie) 
in terms of some matrix kinematic factor N(p). Using this expression for the propa- 
gator in our above position-space inner product, this implies (e.g., using G as in the 


previous subsection for the denominator by using a Fourier transform) 


dp(z) = d(p)e”* == N(p)Mo(p)¥(p) = wh (p) 
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from integrating over a hypersurface at constant time. If we generalize to other flat 


hypersurfaces with timelike normals n™ (e.g., by just Lorentz transforming), we have 


A A 


N(p)n + M(p)(p) = —€(p°)n - pd\(p) 


and finally, by taking linear combinations for different n’s, 


N(p)Mn(p)b(p) = —€(p°) Pmt (Dp) 


If we choose a basis that is orthonormalized with respect to all quantum numbers 


other than momenta (i.e., with repsect to spin/helicity and internal symmetry), we 
i DSP) 
" 2n6[5(p? + m?)] 


If we ignore coordinate/momentum dependence and focus on just these other quantum 


have 


(p, i||p', J) = 6 


numbers, then it is clear that 
N = [e(p°)* S > |i) (i 


where we have included an extra sign factor for negative energy and half-integer spin 
from the reordering of states, as explained above. In an arbitrary basis, we can 


generalize to 


N(p) = [e(p)?? So dt (p) dil) 
The positive-energy propagator is then given by a sum over all positive-energy states: 
A, = N(p)0(p°)276[3(p* + m?)] 


The fact that K is not simply the Klein-Gordon operator is a consequence of un- 
physical (gauge/auxiliary) degrees of freedom appearing in the action: Then N is a 
projection operator that projects out the auxiliary degrees of freedom on shell, and 
the gauge degrees of freedom on and off (in unitary gauges), as represented above 
by a sum over physical states. However, more general N’s are sometimes used that 
include unphysical degrees of freedom; these must be canceled by “ghosts”, similar 


unphysical degrees of freedom of the opposite statistics. 


Exercise VB3.1 
Demonstrate all these properties for spin (helicity) 4 (see subsections ITA6, 
IIB6-7, and HIA4): Find M,, from integration by parts. Find N both from 
inverting to get the propagator and from summing over physical states. Show 
the NM wb identity is satisfied. 
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a For the massless case, use twistors for the solution to the field equation to 
find 


ap)” = —8253€(P°), — (N) 3 (P) = €(P°)Pabg = Pop 


b Do the same for the massive Dirac spinor, to find 
Mn = “Yel Yi; N=p+ Va 


(Hint: Consider the rest frames for p? > 0 and < 0.) 

Exercise VB3.2 
Use the construction of exercise VB1.2 to define the path integral for the 
spinning particle of exercise IIIB1.3. Show that in the covariant gauge v = 1, 


A’ = constant, the propagator can be written as 


_piy2_ien. —1y:p a 
Aig x fe dT O(T)e PIP HEP eae 
- | (Ye 
up to some arbitrary normalization factor, where € = [ dr is the only gauge 


invariant part of X (as T = f dr v is for v). 


We will find that quantum corrections modify the form of the propagator. In 
particular, it may modify the position m? of the pole in —p? and its residue, as well 
as adding terms that are analytic near that pole. For example, consider a scalar 


propagator of the form 


N 
A a 
(P) ‘Tp +m? — ie) = 


where JN is a constant and R is analytic in p. By the procedure of “renormalization” , 


can be set to its original value (see chapter VII). Alter- 


N can be set to 1, and m 
natively, we can cancel N in the normalization of external states, and redefine the 


masses of these states to coincide with what appears in the propagator. 


Exercise VB3.3 
Use this propagator to define the inner product between two plane waves, 
and evaluate it explicitly. Show that R gives no contribution, and the plane 
waves need factors of VN to maintain their normalization. (Hint: What is 
the wave equation corresponding to A, and how is it related to M,,? You can 


also consider the relation of N to A+.) 
Away from the pole, at higher values of —p? than m?, there will also be cuts cor- 


responding to mutiparticle states. Although these higher-energy intermediate states 


in the propagator will contribute to the time development even for on-shell states 
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(those satisfying p? + m? = 0 asymptotically), in S-matrix elements we can ignore 
such contributions on external lines, using our modified definition for evaluating the 


asymptotic limit for the S-matrix (see subsection VA4). 


Exercise VB3.4 


Consider the general scalar propagator 


=-1 * foo = —m —2m)o 
At) =i fan oe —,al) = 5m) + O(n — 2m) 


which contains a pole at mass m and contributions from multiparticle states 
at mass 2m and higher. Fourier transform from energy to time. Use this 
propagator to define the time development of a momentum eigenstate satis- 
fying the free wave equation asymptotically, using the 1 + ze prescription of 
subsection VA4 to define the asymptotic limit: 


u(t)= lim / (t,t) Lids, to(ts) 


t;——00(1+%e) 


and show that o(j) does not contribute: 


(O-m)%=0 > v(t) =dol(t) 


4. Wick rotation 


As in the nonrelativistic case, the ie prescription can also be fixed by the infinites- 
imal Wick rotation (see subsection VA5) 


1 1 


However, in the relativistic case, a finite Wick rotation gets rid of not only 2’s but 
also the annoying minus signs associated with the Minkowski metric. We now replace 


all timelike coordinates, including proper time, with spacelike coordinates: 
[=> =a, 7 + 17 
In addition, for every vector field V* we replace 
ves -iV® (Vi V') 


and similarly for tensor fields. (Here we have defined Wick rotation in the first- 


quantized sense: on all explicit coordinates and momenta, as well as on explicit 
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Lorentz indices. For example, in the field theory action we rotate the explicit deriva- 


tives and the integration measure, rather than the arguments of the fields.) 


Note that FE as defined in the nonrelativistic case was —po, which is the same as 


p° only in Minkowski space: 
p’— —ip’, po > +%po 


Furthermore, there is some apparent ambiguity in how to change the integration 
measure, corresponding to how the integration contours are rotated (i.e., changes 
in the limits of integration). In particular, we see from subsection VA5 that the 
contour rotation for EF is actually in the opposite direction of that for t, consistent 
with Fourier transformation. (Effectively, we keep the extra i for [{ dp from rotating 
po, while dropping the —1 from pp < p° because of the usual absolute value in the 
Jacobian in real changes of variables.) The net result is the naive change for [ dz, 


while that for [ dp preserves the inverse Fourier transform: 
pu = -i f ae, fw - if a, HAs. soy4—20) 


When manipulating explicit expressions, the factors of i on coordinates/momenta 
and fields can be transferred to the constant tensors contracting their indices: The 
net effect is that the Wick rotation is equivalent to changing just 7, the integration 


measures, the flat-space metric, and the Levi-Civita tensor: 


T wT 2 1 @ 
a ai a 
: OT OT 
farsi far farsi fae, [ni far 
mn Onis €abed — =1eaheii 


So now the inner product is positive definite: We have gone from Minkowski space 


to Euclidean space. 
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For example, for the relativistic particle in the gauge v = 1, the propagator is 


now 
1 


= —T(p?+m?)/2 __ 
AG = fe De ROM = 
The integral is automatically convergent because p*+m? is now positive definite. If we 
examine our transformation of timelike components in terms of the complex p® plane, 
we see that we have just rotated the contour from the real axis to the imaginary axis, 
through an angle 7/2, which avoids the poles at p? = +(w — ie). (There is actually 
a slight cheat in the massless case, since the two poles converge near vanishing 3- 
momentum, where w = 0, but this problem can be avoided by an appropriate limiting 
procedure.) Note that in the relativistic case the Euclidean propagator is completely 
real also in momentum space; even the overall —i has been killed. (Compare the 
nonrelativistic case in subsection VA5, where —i/(H—E) — 1/(H-—iE).) This follows 
from first Wick rotating in position space, then performing the Fourier transform as 


usual (avoiding an extra —i from rotating the [ dt in the Fourier transform). 


Exercise VB4.1 
Find the propagator in time-3-momentum space, G(p', t: p",t'), after Wick 
rotation. (Le., Wick rotate A(p) first, then Fourier transform.) 
Exercise VB4.2 
Find the massless propagator in 4D Minkowski coordinate space, including 
the ie, by 
a Fourier transforming the Schwinger-parametrized momentum-space propaga- 


tor in Minkowski space (including the “Minkowski” rT), 


b doing the same entirely in Euclidean space, and then Wick rotating the time 


back to Minkowski space, and 


c Fourier transforming both of the above cases without using the Schwinger 
parameter, first doing the energy integrals as in the previous section. (Hint: 
Use rotational invariance to point # in a particular direction to simplify the 


angular integration.) 


Exercise VB4.3 
Although the propagator in momentum space is most useful for scatter- 
ing of plane waves, its position-space dependence is more useful for bound 


states/scattering of localized sources: 


a Evaluate the Wick-rotated propagator in arbitrary dimensions D for large 
e=as/a* by 


(1) Fourier transforming the Schwinger-prametrized form, 
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(2) performing the (Gaussian) p integration before the T, and 


(3) using the steepest descent approximation on the exponential to approxi- 
mate the T integral (but don’t bother sticking the power of T multiplying 


the exponential in it as a log). 


Why does this approximation correspond to large x? (It may be useful to 
make the redefinition T — (a/m)r.) You should find the result 


A(x) = V2rmO-9)/2¢g-OV/2—-me 


After Wick rotating back, the exponential becomes a phase inside the light- 
cone, while outside it gives exponential damping, as in quantum mechanical 
barrier penetration. (In the massless case, there is damping away from the 
lightcone both inside and outside, but only by powers, determined by dimen- 


sional analysis. ) 
b Show the above result is exact in D=1 and 3 by 
(1) Fourier transforming without the Schwinger parameter, 


(2) performing a contour integral over the magnitude of p by closing it ap- 
propriately and picking up the contributions at the poles at p = +im, 


and 


(3) for D=3, fixing x” in a particular direction and doing the angular part of 


the p integration. 


(This method can also be used to obtain the exact corrections to the above in 
terms of elementary functions for higher odd D.) For the physical case of the 
potential produced by a static point source in 3 spatial dimensions, we find 
A(x) = V2me~™* /x, so at short range the Coulomb potential is unmodified, 


while it is exponentially damped at range 1/m. 


c Check the results for b by taking the massless limit, and comparing to the 
analogous result using the method of a, but doing the T’ integral exactly 
for those cases. (Warning: The result is infinite in D=1, and some type of 
“regularization” must be used to subtract an infinite constant, leaving a finite 


x-dependent remainder.) 


The corresponding effect on the action, where we path-integrated 
ee : S= fer s(um? — vt E" Mn) 
is to integrate the Wick rotated expression 


a a fe s(um? + 0 88" Sinn) 
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This method also applies to relativistic field theory. Wick rotation t — —7t of the 


kinetic term 
eS = f de Hn (Ond)(Ond) + md" 
now gives 


eS y= f de ¥(5"(Ond)(Ond) + m6" 


This has an interesting consequence in the complete action (including interac- 
tions). Positivity of the energy implied the non-time-derivative terms in the action 
had to be positive, but the time-derivative terms are now the same sign in the Wick- 
rotated action, effectively the same as adding an extra spatial dimension: The energy 
T + U is the same as the Wick-rotated Lagrangian (-T. + U — T+U). So we can 
replace the condition of positivity of the energy with positivity of the Wick-rotated 
action, which we need anyway for path-integral quantization. (Note that for the 


particle this again requires v > 0.) 


Although Wick rotation is thus very useful for application to “intermediate” 
states, it can never be applied to physical (i.e., initial and final) states: For ex- 
ample, p? + m? = 0 has no solution in Euclidean space, since each term is strictly 
positive. Typically, this means that one performs quantum calculations first in Eu- 
clidean space, then Wick rotates back to Minkowski space before applying physical 


state conditions. 


Unfortunately, the simple results for Wick rotation obtained here for spin 0, 
although they generalize to spin 1, do not work so simply for other spins. (Consider, 
e.g., trying it in spinor notation.) However, the method can still be applied to the 
coordinates, and simplifies momentum integrals, since one can avoid contours, 7’s, 


and Minkowski minus signs. 
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des Riciaapacneae saints Che Tia soc cestseceseriecsciaases 


A relativisitic quantum field theory is defined by three properties: 


Poincaré invariance (“relativistic”) is the basic result of special relativity. Its con- 
sequences have been accurately observed both macroscopically and (sub)micro- 


scopically, and no violations are known. 


Unitarity (“quantum”) is the main mathematical result of quantum mechanics: 
Any quantum theory can be considered as the corrections to the classical theory 
implied by unitarity. This is one way to define perturbation theory, and is equiv- 
alent to the usual (JWKB) expansion in fh. (The other major axioms of quantum 
mechanics are concerned with the physical interpretation of the quantities calcu- 
lated, such as the preparation and measurement of states.) Quantum mechanics 


also has been accurately verified, with no observed violations. 


Causality (“field theory”) appears in many areas of physics, formulated in many 
ways. The strongest way to state causality, in a way independent of special 
relativity and quantum mechanics, is as locality: All interactions happen at a 
point; there is no action at a distance. This means that any force applied by an 
object at one point in spacetime on another elsewhere(/when) must be mediated 
by yet another object that carries the effect of that force between the two. The 
most accurate verifications of this principle have been through the predictions of 
relativistic quantum field theory. Note that “locality” is what defines spacetime: 
For example, in quantum mechanics x and p are treated on an equal footing. 
Usually p is defined as the generator of a symmetry, but this definition can be 
obscured in a translationally noninvariant potential or in curved spaces (like a 
particle on a sphere, or in general relativity). But the fact that interactions are 
local in x (and are no more than quadratic in p) tells us that events occur at a 


point that can sensibly be interpreted as a “position”. 


We now define the perturbation expansion of the S-matrix, and give its general 


properties. 


le 


Path integrals 


Path integrals for relativistic quantum field theory in four dimensions are better 


in every way than canonical quantization. They are 


(1) 
(2) 


easier to learn and apply: just Gaussian integrals. 


more heuristic: no “Dirac sea” or harmonic oscillators (where are the springs’). 
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(3) less mathematical: no operators, so no time-ordered products (much less “T*” 


ones), Wick contraction, normal ordering, etc. 
(4) more efficient: Functionals make combinatoric factors automatic. 


(5) more rigorous: Constructive quantum field theory has proven the existence of 
certain relativistic quantum field theories in less than four dimensions (working 


in Euclidean space). 


(6) more covariant: The action and Feynman rules are, so the middle steps should 
be. 


As we have seen in simpler examples (subsection VA2), perturbation for path 
integrals is based on Taylor expansion of the exponential of the “interaction” (part 
of the exponent higher than quadratic in the functional integration variables). Since 
general transition amplitudes involve also wave functions, we also Taylor expand 
them: In field theory, this is an expansion in the number of particles. We therefore 


write 
do™...dor 
-> af Tom)NDI oe (21)...0(tn)Mim...Mnn wn (21, 5 2N) 


where we have used the covariant inner product of subsection VB3. (The surfaces of 


integration are at t = oo.) We have also used the free N-particle wave function wy, 
Kywn SS gay Kywyn = 0 


which is sufficient to describe particles at t = too. Amplitudes for such asymptotic 
states are elements of the S-matrix, as defined in the interaction picture (see subsec- 
tion VA4). In practice we choose a particular value of N, and use a basis element for 


wn, namely the product of N 1-particle wave functions: 


N N 
Wn (21, --,2n) = [[ eni(ev) + permutations => Wd) = [ [lle 
i=1 i=1 


In principle, if there are bound states in the theory, we can consider similar wave 
functions, but besides ¢@ we expand in the composite field describing the bound state. 
It should be possible to discover such states by looking at the properties of the 
amplitudes of the ¢ states. (For example, a two-particle bound state would show up 


in the amplitude describing the scattering of two particles.) 


Note that the fields ¢ are real (or we sum over both ¢ and ¢*), while the wave 
functions (W or w) are complex: As usual in quantum mechanics, we work in a 


complex Hilbert space, but often expand over a real basis. For example, w(q) = 
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(0) +0'(0)g +4" (0)q? +... or (o) = w1 (§) +¥ (7). Since in a covariant approach 
we treat particles and antiparticles on an equal footing, Y[¢] should include both the 
initial and final wave function: Thus, the factors in W at t = —oo can be interpreted 
as the usual positive-energy multiparticle wave functions, while the factors at t = +oo 
can be interpreted as the complex conjugate of the usual positive-energy multiparticle 
wave functions. Since they each have one sign of the energy, they are necessarily 
complex. However, for a real field ¢ we can have only (¢|w), while for a complex 
field @ we distinguish (¢|w) as representing a particle at t = —oo or an antiparticle 
at t = +00, and (¢*|W) as an antiparticle at t = —oo or a particle at t = +oo. (As 
usual, which we choose to call particle and which antiparticle is relative, because of 
CPT invariance.) 


We therefore want to evaluate the path integral 
A= [ D6 e96 


(Because the wave functions have free time dependence, the transition amplitude A is 
in this case an S-matrix amplitude S.) Separating out the free and interacting pieces 


of the (gauge-fixed) action, 
S= 5+ S1= f de soK 6+ Sy 


and using the integration identity 


du 
V2 


at v = 0, we can evaluate the path integral as 
o=0 


A = exp (=i fae EE) ec S1lPly[g] 
n) 


. 1 d 10 —_ an! é _ al 


We have dropped the determinant factor, since in our case it will be only an overall 


—uMu du —uMu/2_u —1 
e M 2 tlu+v) = = z M [26 Ov F(v) ~w edeM Ou/2 #(y) 


constant coming from the kinetic operator AK. We then absorbed this proportionality 
constant into the definition of Dd, as we did for the free particle in subsection VA2. 


In this case, this normalization is fixed by the “free” part of the S-matrix. 


It will prove convenient to distinguish the ends of propagators that attach to S; 


from those that attach to W, so using the identity 


fOr)g(aat+y) =f, +Qy)g@y) @=2, y=y+2) 


C. S-MATRIX 337 


evaluated at x = y = 0, we rewrite this expression as 


515 516 ,616\_, 
— ap pe De | Bale 
a cop ( ‘f igriet exist tes) ety) _ 


We then evaluate the derivatives that act on only one or only the other: 


A 


o=0 


a) ag fd16)\.2 
vial (+f sexae) 


Z|] (the “generating functional” for the S-matrix) contains all propagators with 


p=0 


S7[@]’s attached at both ends, and forms the basis of the perturbation expansion. 


From the integration identity above, we can also write it as 
Ziel = / Dé e7iSolel+S1lo+e)) 


Effectively, we have just taken the functional integral [ D@ e~*° [| and separated the 
field into a “quantum field” ¢ (the integration variable) and a “background field” y, 
where y includes the asymptotic states, which propagate to infinity, while ¢ vanishes 
at infinity (or at least goes to a constant) fast enough to allow the usual integration 
by parts in performing the functional integral. Thus y gives the boundary value of 
the field. This is essentially the same as the general prescription for path integrals 
given in subsection VA2, except that we take y to be arbitrary for convenience of 
functional differentiation, and we drop free y terms, which were incorporated into W 
(i.e., we expand just $7). 

vo] is the result of contracting some pairs of the one-particle wave functions 
with propagators. This gives the usual inner product of those one-particle states: 
Since for any one-particle wave function satisfying the free field equations we have 


the propagator identity 


i _80"_(0) Mn Ale — 2°) = Ue) 
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such contractions give 


da™da' / / / / 
[rvomae — £)M,v (2) 


m 
=f apyore HC2) Mn! (x) = (wll) = (WB) 

where the inner product vanishes unless ~ and y’ have opposite energies (i.e., one is 

incoming and one is outgoing). This is the boring part of the S-matrix element: It 

represents the corresponding particles not interacting at all. (For example, in the free 

case S; = 0 we have Z = 1, and A= W|4—0 consists of only such inner products.) For 

most purposes we just factor out such free inner products, and consider only processes 


where all particles interact. 


Finally, the conversion from W to W replaces all the on-shell inner products with 


integrals over all spacetime: Using the propagator identity from above, 


ales t) ”) 
| = af Baron a 55a) Tolen) 


where w Nn appears in W exactly as dy in WV: 


GO ac 
-y3 N! / ead co ON Wn (x1, +++) €N) Mim. Myn0(21).--(en) 


Usually we represent : n as the product of single-particle wave functions, 


ow [S]=I (fom) = v= T(J omg) 2 


in which case W simply replaces each field @ in Z with one of these wave functions. 


As Sat Sr 
A-— 1: ( )S7 


(Il) dOe 
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Thus, of the three types of propagators, only the ones that connected two factors 
of S; remain, all inside Z; the ones that connect the wave functions to Z have been 
replaced with just spacetime integrals, while those connecting the wave functions to 


each other have become the usual spatial integrals for the Hilbert-space inner product. 


Exercise VC1.1 


Show that the amplitude can also be written as 


A= 0 Zyoiel| 


dNDe 6 ) 
ae a hf aagRBR YN ORES Feta) 
Wy = eldllo0/2 


making all inner products of non-scattering particles explicit. (Hint: Write 
both this and the earlier form of the amplitude completely explicitly, in terms 
of operators on Z|¢]W[y].) We can interpret WY as the free “vacuum wave 


functional” and ZW as the interacting vacuum wave functional. 


2. Graphs 


Before giving applications of these rules, we consider a few general properties. A 
convenient way to describe the terms in the expansion of the two exponentials in Z 
is pictorially, by “Feynman diagrams/graphs”. Each factor of S; is a “vertex” of the 
graph, represented by a dot in the diagram; each factor of the propagator 1/K is a 
“link” in the graph, represented by a line connecting the two dots representing the 
two factors of S; on which each 6/dy acts. (Both derivatives can also act on the same 
factor of S;, giving a loop.) So any term in the expansion of Z is represented by a 
diagram consisting of a bunch of dots (interaction vertices) connected by lines (prop- 
agators). When we want to “draw” the amplitude A itself, we also draw additional 
lines, each with one end attached to a vertex and one end unattached. These “external 
lines” represent the one-particle wave functions coming from Wj, and not propagators 
(“internal lines”). While in the diagram for Z each vertex can have y dependence, 
in the diagram for A there is none, and the number of lines (internal and external) 
coming from any vertex explicitly indicates the order in y of the corresponding term 


in Sy. 


The physical interpretation of these diagrams is simple: The lines represent the 
paths of the particles, where they act free, while the vertices represent their collisions, 


where they interact. These diagrams are generally evaluated in momentum space: We 
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then can associate a particular momentum with each line (propagator), and momen- 
tum is conserved at each vertex. An arrow is drawn on each line to indicate the 
direction of “flow” of the momentum. (Otherwise there is a sign ambiguity, since 
complex conjugation in position space changes the sign of the momentum.) Then the 
sum of all momenta flowing into (or all out of) a vertex vanishes. The momentum 
associated with a line is then interpreted as the momentum of that particle, with 
the arrow indicating the direction of flow of the proper time 7. (p changes sign with 
T.) When evaluating an S-matrix element, the fact that the external-line wave func- 
tions satisfy the free wave equation means the external momenta are on-(mass- )shell 
(p? + m? = 0); on the other hand, this is not true of the momenta on the internal 


lines, even though those particles are treated as free. 


Some graphs in the S-matrix are disconnected: They can be divided into separate 
parts, each with a subset of the particles that interact with each other but not with 
the other subsets. For convenience, we consider only the connected graphs: If we 


write 
ail=e™, A= 0] 2] (Welly 

then W is the generating functional for the connected S-matrix A,. To prove this 
relation between Z and W, we first note that it is just the combinatorics of the 
graphs, and has nothing to do with spacetime. Therefore, it is sufficient to consider 
the simple (unphysical) case where the action has no derivatives. Since the propagator 
is then local, connectedness is equivalent in this case to locality. We then observe 
that the lack of derivatives allows the functional integral to be factorized explicitly 


into ordinary integrals at each point in spacetime: 


Wig = ij Dé ei & LOW ee) — Ul / dar) eH6(2),0(2)) 


=|[2¢@) = ei fd We) _ --iWie 


Thus, this W is local, and therefore connected; this implies W is connected in the 
general case. The simplest kind of connected graph is a “tree” graph, which is a graph 


that has no closed paths; the rest are called “loop” graphs. 
“One-particle-irreducible” (1PI) graphs are defined to be those connected graphs 


that can’t be disconnected by severing a single propagator. It then follows that any 
connected graph can be represented as a generalized tree graph, whose “vertices” 
(including two-point vertices) are actually 1PI graphs. We then define the “effective 
action” I"|¢] to be the classical action plus all 1PI loop graphs. Note that the vertices 


of the original action are the 1PI tree graphs; thus I’ is also the classical kinetic term 
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plus all (tree and loop) 1PI graphs. Actually, since the 1PI tree graphs are —iS7, we 
define the classical part of I’ to be S, but the quantum part to be the quantum 1PI 
part of W. Of course, the effective action is nonlocal. However, the tree graphs that 
follow from this action are exactly all the connected graphs of the original action: 
This is clear for all but the 2-point vertices from the definition. For the propagator 
and its relation to the 2-point 1PI loop graphs, we simply compute the expression 
following from I’: Denoting the 2-point 1PI loop operator as A, the kinetic operator 
of I is K + A. The propagator following from I’ is then 

1 1 1 1 111 

KA ke koe RK ROR 

But this is exactly the result of the complete propagtor (including loop graphs) fol- 


lowing from the original action. 


This quantum modification of the propagator leads us to reanalyze our prescrip- 
tion for evaluating the S-matrix: For example, even in the simplest case, where this 
A is just a constant, the full propagator differs from the free propagator by a change 
in the mass. This means the mass of asymptotic states should also be changed, which 
invalidates part of our evaluation of the S-matrix in the previous subsection. Similar 
problems occur when A is proportional to K, which changes the normalization of 
asymptotic states. There are two ways to fix these problems: (1) We compensate by 
modifying the kinetic term in the classical action, replacing AK with K minus such 
local contributions from A. Treating these new terms as part of the interaction in 
our derivation of the S-matrix, so our normalization and mass in the free propagator 


are unchanged, these “interaction” terms cancel the unwanted terms in the quantum 
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propagator, so it then has the same residue and pole as the free one. This procedure 
is known as “renormalization” , and will be discussed further in chapter VI, primarily 
for the purpose of eliminating infinities. (2) Alternatively, we modify our derivation 
of the S-matrix to take the full propagator into account. The easiest way to see this 
change is to remember that by definition the S-matrix follows from treating the effec- 
tive action as a classical action (except for its nonlocality and nonhermiticity), but 
keeping only the “tree” graphs. Then clearly (a) the quadratic part I of I’ is used 
to define the asymptotic states, and (b) instead of eliminating all free propagators 
except those connecting factors of $7, we eliminate all full propagators (found from 
Io) except those connecting factors of I, the nonquadratic part of I’. In other words, 
we modify our earlier definition of the S-matrix by dropping all graphs that have any 
quantum correction to external lines. Thus, this procedure can also be applied in 
the case of renormalization; in fact, it should be applied in general, simply because 
it allows us to immediately ignore many graphs. It also allows us to avoid confusion 
resulting from attaching wave functions of the wrong mass to propagator corrections: 
E.g., in momentum space, we would have to interpret ambiguous factors such as 


6(K)A(1/K)..., where the factor of 6(/) comes from a plane-wave wave function. 


© © 
BOOEE > DOC 


This analysis of the quadratic part of I’ also leads us to examine the terms of 
lower order: constant and linear. The constant term is just a normalization, and 
should be dropped. (This is not true in the case of gravity, where a constant term in 
the Lagrangian is not gauge invariant by itself.) The linear term describes the decay 
of a particle into the vacuum: It implies we have the wrong vacuum. A linear term 
necessarily has no derivatives (otherwise it is a boundary term, which vanishes by our 
boundary conditions); it is part of the “effective potential” (a generalization of the 
potential energy, whose contribution to a classical mechanics Lagrangian contains no 
time derivatives; see subsection VIIB2). The existence of a linear term means that 
the minimum of the effective potential, i.e., the true vacuum, is not described by 
vanishing fields. To correct this situation we therefore apply the same procedure as 
for the classical action (chapter IV): (1) Shift the appropriate fields by constants, 
to put us at the minimum of the potential, and (2) use the new quadratic terms in 


the potential to determine the true masses of states defined by perturbation about 
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this new vacuum. Again eliminating any constant terms, the resulting I’ has only 


quadratic and higher-order terms. 
To summarize, the general procedure for calculating Feynman graphs is: 
(1) Calculate the effective action, i.e., the 1PI graphs. 
(2) Shift the scalars to put them at the minimum of the effective potential, dropping 


the resultant constant, to reveal the true masses of all particles. 


(3) Calculate the S-matrix from diagrams without external-line corrections, with ex- 
ternal wave functions whose normalization and masses are determined by the 


zeroes of the kinetic operators in the shifted I’. 


Another use of the effective action, besides organizing the calculation of the S- 
matrix, is for studying low-energy behavior: This means we apply an expansion in 
derivatives, as in first-quantized JWKB (see subsection VA2). Of most interest is the 
lowest order in the approximation, where all fields are effectively constant: This gives 
the effective potential. (In practice, “all fields” means just the scalars, since constant 
spinor fields are not generally useful, while higher spins are described by gauge fields, 
whose constant pieces can be set to vanish in an appropriate formulation: E.g., the 
constant piece of the metric tensor can be attributed to a scalar — see subsection 
IXA7.) However, the definition of “1PI” graphs is ambiguous, depending on how we 
define “particle”: For example, if we include auxiliary fields in the effective action (as 
in supersymmetry, but also for bound-state problems: see subsections VIIB3 and 6), 
the result at fixed order in any expansion parameter (fh, coupling, etc.) is different, 
since the auxiliaries get contributions at each order, so elminating them by their 
effective-action field equations mixes orders. (E.g., B? +hABf(A) — h?f?.) This is 
crucial when the composite fields defined by these auxiliaries, and thus the auxiliaries 
themselves, obtain vacuum values. Therefore, the effective action is most useful for 
these purposes when, for appropriate choice of fields and definition of h, a useful 
first-quantized semiclassical expansion can be found. Another important use of the 
effective action is that it is gauge invariant (even in the nonabelian case, when using 
the background-field gauge; see subsection VIB8): Sometimes simplifications due to 


gauge invariance are thus easier to see in the effective action than in the S-matrix. 
We now consider an interesting topological property of graphs: 


(1) For any graph, if we draw an extra propagator from a vertex to itself or another, 


that gives an extra “loop” (closed circuit) and no extra vertices. 


(2) Adding a 2-point vertex to the middle of a propagator or external line gives an 


extra propagator and no extra loops. 
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(3) Adding an external line to a vertex changes nothing else. 


Since any nontrivial (not a lone propagator) connected graph can be built up this 


way from a lone vertex, we find 
P-V=L-1 


for P propagators, V vertices, and L loops (and E external lines). The same result 
follows from counting momentum integrals: In momentum space there is an inter- 
nal momentum, and corresponding integral, for each propagator, and a momentum 
conservation condition, and corresponding 6 function, for each vertex. The only in- 
dependent momenta are the external ones (associated with each y in Z|y]) and one 
momentum vector for each loop. Thus, after integrating out all the delta functions, 
except for an overall momentum conservation 6 function for each connected graph, 
we are left with integrations over only the loop momenta. So, we are again led to the 


above result. 


Exercise VC2.1 
For the figure at the beginning of this subsection, check this identity for each 
of the 3 connected graphs. Apply the above construction to produce each of 


them from a single vertex. 


3. Semiclassical expansion 


We can define perturbations by inserting h’s in various ways, as discussed in sub- 
section IIJA3. The fh that defines classical mechanics yields an expansion in derivatives 
on “matter” fields (those that describe classical particles in the limit h — 0, as op- 
posed to the “wave” fields). This expansion is covariant as long as the h multiplies 
covariant derivatives. However, it can’t be applied to Yang-Mills fields, and it doesn’t 
correspond to a diagrammatic expansion. On the other hand, the h that defines clas- 
sical field theory is an expansion in the number of “loops”, and allows us to group 
graphs in gauge-invariant sets, since gauge transformations are not h-dependent: As 
in quantum mechanics, we can perform a JWKB expansion by appropriately inserting 
h: 


Zia = | Do ex { -7 (Sle) + 516+)} 


6106 
= exp nin fae xe} e Silel/h 


The order in h has a simple graphical interpretation. We see that there is a factor 


of h for each propagator and a factor of 1/f for each vertex. Thus, by the above 
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topological identity, for each connected graph the power in fh is one less than the 


number of loops. We therefore write 


Z[gl=e Wel, WHS hw, 


L=0 
where Wo generates the connected “tree” graphs, which have no loops. 


We know that the leading term in the JWKB expansion is associated with the 
classical theory. We can make this more explicit in the field theory case by finding the 
general classical (perturbative) solution to the field equations from the tree graphs. 
Graphically the solution is very simple: We replace one y on each tree graph with a 
propagator, and associate the end of the propagator with the position of the classical 
field (x). If we then act on this ®, which is a sum over all tree graphs, with K, 
it cancels the propagator, leaving a bunch of ’s (also sums over all tree graphs) 
connected at x, with the appropriate vertex factor. In other words, we find K® = 
—0S7[®|/6®, the classical field equations. 


To prove this, it’s convenient to again use functionals, to automatically keep track 
of all combinatorics. The quantum field equations can be derived from the general 
identity F 

Do — =) 
[pe stl 
since we only integrate functionals f that are assumed to fall off fast enough as 


@ — oo to kill all boundary terms. (This follows from the perturbative definition of 


the functional integral.) In particular, for any action S, 
O38 65 _.¢ 
j= Do th— —iS/h -_ JP uM is /h 
/ ot j a @ j a 
For our present purposes, we choose 


§ = Sold] + Silo + 9] 
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bb 
OWI] 
K ='0) 
where “(@),.” is the expectation value of the field in a background: 
Do hei5/h 
bp = Spee 
{ Do e-*/ 


(and, of course, [ D@ e-t5/h — @iW/h), 
We now examine the classical limit h — 0 of this result by noting that if we 
impose the free field equation on the background 
58 _ 5S[g +9] 
Og dg 
where S[¢] = So[¢] + S7[¢] is the usual action: In other words, the field equations 


following from S' are just the usual field equations for the complete field 


Kp=0 => 


g=o+y, y=limd 


since we chose our boundary conditions so ¢ — 0 as x — oo (including |t| — oo, 


whereas y — 0 only at spatial infinity). We then apply the stationary-phase approx- 


imation (or, after Wick rotation, the steepest-descent approximation) 


lim / Déo fide" = (Flele*/ "| 


5S'/5¢=0 


for f = @ and 1 to find 


Wilyl_ 9 « goy— 1 dwolel 
dy Kop 


? = lim(P) 9 = $lssi41/66=0 


Thus, @ is the solution to the classical field equations with boundary condition @ — y, 


K@+ 


and can be found directly from the classical (0-loop) part of W by replacing one field 


with a propagator. 


A similar result holds for Wo itself, by taking the classical limit as above for 


f{e] = 1: 
Wolv] = Sold] + Si[o + ¢] 


when evaluated at the result of varying the above with respect to either argument: 


6, sslb+el 


36 7 oe 
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5 6 sWolyl 
cee et 
jo 66 "he ? 


The former follows directly from the limiting procedure; the latter we have just proven 


equivalent by evaluating the limit for (¢),. Since by definition the effective action I 
is related to W in exactly the same way that S is related to Wo (the trees from I 
give the full W), we also have 


Wie] = Told] + Tilo + 9] 


6rlot+y] = OWly] ig 
oe Kyp=0 ¢ — Ko 


where K is the kinetic operator appearing in Ip, the quadratic part of I’. (Some care 


must be taken for the fact that the poles and residues of K in p? may differ from 


those of K, as discussed in the previous subsection.) 
In practice, if one wants to make use of the classical field equations perturbatively, 
one looks at tree graphs with a specific number of external lines: For example, in a 


scalar theory with ¢° interaction (assuming (®) = 0), 


oo n-1 
KO+38 =0, P=S°G, > KO, +4)° 4, G,-m =0 
n=1 m=1 


gives a recursion relation for the term @, that is nth order in y. 
Exercise VC3.1 


Consider the relativistic Schrodinger (Klein-Gordon) equation for a scalar 


wave function w in an external scalar potential ¢: 


(K + dy =0 


(If you find it less confusing, you can consider the nonrelativistic case K = 
p?/2m — E, where H = p?/2m-+¢.) Find the perturbative solution for the 
quantum mechanical (one-particle) S-matrix for w (see exercise VA4.1). Show 
that this agrees with the contribution to the field-theoretic S-matrix for the 
Lagrangian 


LW, 9) = V*(K + bh + Lo(¢) 


coming from tree graphs with an external w line, an external 7* line, and an 
arbitrary number of external ¢ lines. 


Exercise VC3.2 


Consider, instead of the background field y, a “current” source J that attaches 
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propagators to external lines. The current is effectively just a one-point in- 
teraction (it caps loose ends of propagators), so it can be introduced into the 
generating functional by the modification 


Sil0| > Sild] + f de Jo 


We now have 


e iW [J] /h = 


[ D0 exp{—F. (Sildl + Sila] + f av so) } 


Z| J] is thus the Fourier transform of e~‘5!#!/" with respect to the conjugate 
variables ¢ and J. 


a Derive the “Schwinger-Dyson equations” 


+ 109) ore at 


( 6S{é) 
\ 55a) 


p= 1h5/5T 


b Find the classical limit 


oWolJ] 


Wold] = Sid] + JO at o= 7 _55I8) 


a “Sd 


(ie., Wo[J] is the “Legendre transform” of S[@]). Find the corresponding 


relation for ['[¢] and W[J]. 


c Show that the free part of W[.J] is given by 


W fees = —$ / dx ae Rg! = i free = RK? 


Note that J can be replaced with y in W[.J] everywhere except the free term, 
since in all other terms J appears only in the combination (1/K)J. Show 


how this can be done in such a way as to reproduce the results above for the 


solution of the classical field equations in terms of W[y]. Warning: Before 
this substitution we use Ky + J = 0, but afterwards we apply Ky = 0; also 
beware of integration by parts, since y does not vanish at oo, so the naive 
substitution ¢ > ¢+ is not very helpful. (Historically, Z was introduced as 
a functional of J. However, the only two applications of Feynman diagrams, 
the S-matrix A and the effective action I’, both required that the external- 
line propagators resulting from the Feynman rules for Z[.J] be “amputated”. 
Therefore, we use background fields exclusively. The resulting derivations, 
generalities, and applications are at least as simple as and often a little simpler 
than the corresponding ones with current sources.) 
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There is a major flaw in this relation between classical field theory and tree 
graphs, the “Klein paradox”. The difference is that classical field theory uses fields 
everywhere, not wave functions. Thus the propagator must be real (or pure imaginary, 
depending on conventions) to preserve the reality (or complex conjugation) properties 
of the fields; usually one uses the retarded propagator. On the other hand, in quantum 
field theory negative-energy states must travel backward in time to preserve positivity 
of the true energy, so the complex Stiickelberg-Feynman propagator must be used. 
Furthermore, in classical field theory the external line factors are the fields, which 
contain both positive and negative energies, the same on each line. In quantum field 
theory, each external line carries a different one-particle wave function, positive energy 


in the asymptotic past or negative energy in the asymptotic future. 


4. Feynman rules 


It is usually most convenient to calculate Feynman diagrams in Wick-rotated 
(to eliminate i’s) momentum space (where massive propagators are simpler). The 


“Feynman rules” are then read off of the action as 


where in Z|¢] we simply replace each field ¢ with a single-particle wave function in 
all possible permutations, since for the case of an N-particle amplitude we usually 
write the wave function as the product of N single-particle wave functions (although 


more generally it is a linear combination of these): 


We Fourier transform as 


a(e)= fap e®*o(p), lp) = f de e202) 


(Of course, ¢(x) and ¢(p) are different functions, but the distinction should be clear 
from context.) In practice we choose the single-particle wave functions to be eigen- 


states of the momentum, so 


A 


Pi(x) = e'**h,(p;), Wilp) = 6(p — pi)di(pi) (p; +m? = 0) 
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where o is some simple factor (e.g., 1 for a scalar). Then the external line factor 


terms become 


fu wear = bse 


while for propagator terms 


a 1,9 1 6 
/ Ot 55a) K(—10) 80a) / a 255@) K@) 50D) 


and for vertex terms 


[ae 6162) dnl) = f dorm 61001) --dulPn)B(D1 + + Pe) 


where each of the @’s in the vertex may represent a field with derivatives; then we 
replace —i0 on ¢(x) with p on ¢(p). Thus, e.g., we have 
» } 
An = Wni(Di a Z\o 
™ II = 5Bpo 0] 
Exercise VC4.1 
Use the definition 


to show that 


5) = ae 


where we now use tildes to indicate Fourier transformation. 


Note that there is some ambiguity in the normalization of external line factors, 


associated with the numerator factor in the propagator 


Aes 
KK 3(p? + m?) 
For nonzero spin (or internal symmetry), we have already discussed the normalization 


analogous to that for scalars, namely 


Sb (p)b(p) = £N(p) 


(with — for negative energy and half-integer spin). However, there is already some 
freedom with respect to coupling constants: Even for scalars, if a coupling appears in 
the kinetic term as a factor of 1/g?, then effectively the kinetic operator is K = Ko/g’, 
where Ko is the usual (coupling-independent) one. Thus N = g?No, so w = guo, 
meaning coupling dependence in external lines. Alternatively, this external-line fac- 


tor of the coupling can be included in the definition of probabilities in terms of 
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amplitudes, which already includes nontrivial factors because of the use of (non- 
normalizable) plane waves. Furthermore, quantum effects modify the form of the 
propagator: Such effects can be absorbed near the pole p? = —m? by a field redef- 
inition, but often it is more convenient to leave them. Then N will again have a 
constant, (but more complicated) coupling-dependent factor, which must be canceled 
in either the external-line factors or probabilities. (However, note that these questions 


do not arise in calculations of the functionals W|¢| or I’[¢].) 


In tree graphs all momentum integrals are trivial, with the momentum conserva- 
tion 6 functions at each vertex, and the 6 functions of the external lines, determining 
internal momenta in terms of external momenta. In loop graphs there is a momentum 
integral left for each loop, over the momentum of that loop. The amplitude will al- 
ways have an overall 6 function for momentum conservation for each connected piece 
of the graph. Since we are always interested in just the connected graphs, we pull 


“en 
4 


this conservation factor off to define the “T-matrix”: Including the factor of from 


Wick rotating back to Minkowski space, 


Sconnectea = 15 (Sp) T 


In general there will be combinatoric factors associated with a graph. These 
follow automatically from the functional expressions, but can also be seen from the 
symmetries of the graph. Here “symmetries” means ways in which the graph can 
be twisted, with external lines fixed, such that the graph looks the same, including 
the types of particles propagating along the lines. For example, a graph with 2 
vertices that are connected by n identical propagators would get a factor of 1/n! for 
that symmetry. There are also sign factors from fermions: Permutation of external 
fermion lines gives minus signs, because it involves permutation of anticommuting 
fields in the functionals. Each fermion loop gets a minus sign for the same reason. 
(This is related to the fact that fermionic integration gives determinants instead of 
inverse determinants.) Explicitly, it comes from evaluating expressions of the form 


(ignoring momentum dependence and external fields) 
6.6 6.6 = <i 
(23) e (=3) (dw) --- (dp) 
a a 6 O 6 -_ 
- (aaa) *[ aor) Gao?)| ~ [Caer Ga?) 


where the propagator derivatives (5/dw) (6/6) give no signs connecting up successive 


vertex factors ~w, but the last one does in closing the loop. 
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The general rules for contributions to the (unrenormalized) effective action I"[d] 


are then: 
(A1) 1PI graphs only, plus So ([ 3¢K¢). 


(A2) Momenta: label consistently with conservation, with | dp for each loop. 

(A3) Propagators: 1/5(p? + m?), or 1/K, for each internal line. 

(A4) Vertices: read off of —Sr. 

(A5) External lines: attach the appropriate (off-shell) fields and [ dp, with 6(>° p). 
(A6) 


A6) Statistics: 1/n! for n-fold symmetry of internal/external lines; 


—1 for fermionic loop; overall —1. 
(If we want to calculate W[¢] instead, then simply replace step 1 with “Connected 
graphs only”.) The next step is to analyze the vacuum: 


(B1) Find the minimum of the effective potential (for scalars). 


(B2) Shift (scalar) fields to perturb about minimum; drop constant in potential. 


(B3) Find resulting masses; find wave function normalizations. 


Renormalization is performed either before or after this step, depending on the 

scheme. Finally, the trees from J’ are identified with the complete amplitudes from 

S. Thus, T-matrix elements are given by: 

(C1) Connected “trees” of (shifted, renormalized) I’: (A2-4) for L=0 with S - I. 
C2) “Amputate” external [-propagators. 


(C2) 
(C3) External lines: 1, or appropriate to Iy wave equation Kw = 0 (Sovitd = +N). 
(C4) 


C4) External-line statistics: No symmetry factors; —1 for fermion permutation. 


Note that I” is usually simpler than T with respect to treatment of external 
lines: In 7’ we often have contributions from graphs which are identical except for 
permutation of external lines from identical fields. In I” only one such graph need 
be considered, since the statistics of the attached external fields automatically takes 


care of this symmetry. (We then also drop the 1/n!, or at least reduce it.) 


We label the lines of a graph with arrows to indicate the direction of “flow” 
of momenta: Momentum conservation means the total momentum flowing into any 
vertex is equal to that flowing out, which we use to eliminate dependent momenta 
(integrating out 6 functions). (In tree graphs all internal momenta are determined by 
external ones, which are constrained by conservation of total external momentum. ) 
This “sign” of direction of the arrow is independent of the sign of the energy; we must 
combine the two to determine whether an external state is initial or final: An incoming 
external line with positive energy is initial, negative energy is final; an outgoing line 
is the opposite (i.e., for positive energy the arrow indicates the direction of time, but 


negative energy means travel backward in time). The choice of direction of arrows 
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is arbitrary, and the convenience of any choice depends on the graph and the theory. 
There is no correspondence between this choice and signs of energy, since generally 
one wants to apply the same graph for cases with each external line with either 
sign, whereas internal lines in trees may have either sign depending on the external 
kinematics, and those in loops must be summed over both signs. Often the direction 
of the arrow is chosen to indicate the direction of flow of positive charge, when such 


a quantum number (U(1) symmetry) exists. 


The simplest nontrivial tree graphs are 4-point amplitudes. We now label all 
momenta as incoming, which is convenient for symmetry, and corresponds naturally 
to using incoming (initial) states with positive energy and outgoing (final) states 
with negative energy (as from the complex-conjugate final wave functions). These 
momenta are conveniently expressed in terms of the Mandelstam variables (see sub- 


section IA4): with these signs, 


s = —(p1 + po)’, t = —(pi + ps)’, u=—(pi + pa)” 


We also use the convention that s is defined in terms of the momenta of the two 
initial particles (and we also use this same definition when there are more than two 
final particles); t and u are then more or less interchangable, but if initial and final 


particles are pairwise related we choose ¢ in terms of the momenta of such a pair. 


1 o 1 3 
+ ty 1+3 + 144 Vu 
2 4 2 4 


For example, the simplest nontrivial theory is ¢° theory, with 
K =}(-O +m?) 3 Hp? +m’) 


S7= pu 590° = ie dp dps 59O(P1) O(p2) o(p3)d (Sop) 


The four-point S-matrix amplitude at tree level (order g?) then comes from using the 


following contributions to the factors in A (calculated in Euclidean space): 
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J (6/50)(1/K)(6/56)/2 _ / dk 1—° ! 3 
e€ ieee 2 5d(k) E(k? + m2) OOK) 
er = +5597 +... 
Using (6/6¢(p))é(k) = 6(p—k), keeping only the connected part, and integrating out 


the 6 functions (except 6(>> Decternal)), we are left with 


1 1 1 
S=0 2 (| ——__ +. ——__ + —____ 
(pi + po + p3 + pa)g (necytiecy tT) 


(There would be an extra iz for the 6 in Minkowski space.) Note the symmetry factor 
1/3! for the ¢* coupling, which is canceled upon taking 3 functional derivatives for 


the vertex factor. The T-matrix then comes from just factoring out the 6: 


But this contribution to W is given by a single term: 


1 
Wie ee / dpidpodps 2l20(p1)(P2) goa — 5 laelps) (Ps — pz — ps)| 
or, in position space, 
__ 2 1/142)___! 142 


(The 4's correspond to the various symmetries: switching a pair connecting to the 


same vertex, or switching the two pairs.) 


Exercise VC4.2 
Find the 5-point tree amplitude for ¢° theory. What order in g is the n-point 


tree? 


The momentum integrals are real in Euclidean space: There are no singularities 
in the integrand, since p* + m? is always positive (although there are some subtleties 
in the massless case). Thus, all these integrals are most conveniently performed in 
Euclidean space. However, eventually the result must be analytically continued back 


° _, ix®, which means p® — ip® (but po — —ipo, being also 


to Minkowski space: x 
careful to distinguish 6”, — 6” and dmn — "mn for indices on fields) via a 90° rotation. 
This returns some of the 7 dependence. There are also 7’s associated with integration 
measures: Since all the momentum integrals for the S-matrix elements have already 
been performed, all that remains is a factor of i to go with (> p) for each connected 


graph. There can also be 7 dependence in external line factors. 
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Remember that negative p° indicates a particle traveling backward in time; the 
true motion of such a particle is opposite to that of the arrow indicating momentum 
flow. Thus, external lines with arrows pointing into the diagram and positive p°, or 
out of the diagram and negative p’, both indicate initial states, arriving from t = —oo. 
Conversely, external lines with arrows pointing into the diagram and negative p®, or 


out of the diagram and positive p®, both indicate final states, departing to t = +o0. 


A related issue is particles vs. antiparticles. If a particle is described by a real 
field, it is identified as its own antiparticle; but if it is described by a complex field, 
then it is identified as a particle if it has a certain charge, and as an antiparticle if 
it has the opposite charge. (For example, a proton is positively charged while an 
antiproton is negatively.) Of course, this is convention, since a complex field can 
always be replaced by two real fields, and we can always relabel which is the field 
and which the complex conjugate; generally there should be a useful conservation law 
(symmetry) associated with these complex combinations (usually electric charge), 
and the one called “particle” is the one more common to the observer. For example, 
suppose we have a complex scalar external field/wave function ¢(p). For p® > 0 this 
describes a particle propagating to x° = +o0. Similarly, 6*(p) for p® > 0 describes 
an antiparticle propagating to 7° = +oo. On the other hand, ¢* for p® < 0 describes 


a particle propagating from 2° = —oo, while ¢ for p® < 0 describes an antiparticle 


propagating from 2° = —oo. 


5. Semiclassical unitarity 


As in nonrelativistic quantum mechanics, the only conditions for unitarity are 
that: (1) the metric (inner product) on the Hilbert space is positive definite (so all 
probabilities are nonnegative), and (2) the Hamiltonian is hermitian (so probabilities 
are conserved). Both of these conditions are statements about the classical action. 
The second is simply that the action is hermitian, which is easy to check. The first is 
that the kinetic (quadratic) terms in the action, which define the (free) propagators, 
have the right sign. This can be more subtle, since there are gauge and auxiliary 
degrees of freedom. Therefore, the simplest way to check is by using the lightcone 


formalism. 


We see from the analysis of subsection IIB3 for field equations, or for actions in 
subsection HIC2 (for spins < 1, and more generally below in chapter XII) that after 
lightcone gauge fixing and elimination of auxiliary degrees of freedom the kinetic 
terms for physical theories always reduce to —}¢D¢ for bosons and $u(1/i0*)w for 


fermions, where there is a sum over all bosons and fermions, and each term in the 
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sum has a field with a single hermitian component. Complex fields can multiply their 
complex conjugates, but these can always be separated into real and imaginary parts. 
There are never crossterms like ALB, since after field redefinition, i.e., diagonalization 
of the kinetic operator, this gives A’/DOA’ — B’OB’, so one term has the wrong sign. 
Similar remarks apply to massive fields, but with 0 replaced by D—m? (as seen, e.g., 


by dimensional reduction), or we can treat the mass term as part of the interactions. 


Exercise VC5.1 
What’s wrong with ADA + BOB +m?AB? (Hint: something, but none of 


the above. Diagonalize.) 


Now we only need to check that the single-component propagators of these two 
cases define positive-definite inner products. Since multiparticle inner products are 
products of uniparticle inner products, it’s sufficient to look at one-particle states. 
We therefore examine the S-matrix defined in subsection VC1 for the special case of 
1 particle at t = —oo going to 1 particle at t = +-oo, using the free, massless lightcone 
Lagrangians given above. For the boson we found that this matrix element was simply 
the inner product between the two states, appearing in the form f[ do do’ Avy’. This 
worked only because the propagator had the right sign. (It is essentially +e~“!4.) 


Thus the sign we use is required for unitarity. 


A simple way to treat the fermion is to use supersymmetry: Since the boson and 
fermion kinetic terms are spin independent in the lightcone formalism, we can look at 
any supersymmetric theory, and check the boson and fermion kinetic terms there. If 
the boson term agrees with the one we just checked, then the fermion term is OK by 
supersymmetry (which preserves unitarity by the lightcone-like supertwistor analysis 
of subsection IIC5). Alternatively, we can use the same method applied to the boson: 
The propagator now has an extra factor of 1/(—i0*), or 1/pt in momentum space. 
Since p* is always positive for positive energy, these states also appear with the 
correct-sign norm. To analyze negative energy (antiparticles), we note that in the 
derivation of the path integral final states always appear to the left and initial states 
to the right. In the fermionic case this is important because it will introduce an extra 
sign: Since 


V1 ave = Haart 


(with two signs canceling from reordering fermions and integration by parts), the 
right sign will always be produced with correct ordering of initial vs. final states 
(i.e., positive vs. negative energy), independent of the helicity (or whether electron 


vs. positron, etc.) 
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For similar reasons, it is clear that integral spin is always described by commut- 
ing (bosonic) fields, while half-integral spin is always described by anticommuting 
(fermionic) fields: The number of undotted minus dotted indices on a field is always 
odd for half-integer spin, even for integer, and a derivative carries one dotted and one 
undotted index, so contraction of all indices means an even number of derivatives for 
integer spin and odd for half-integer. Without loss of generality, we then can separate 
each field into its real and imaginary parts. Then for each real field integration by 


parts gives 
o(—i0)"@ = (—1)"[(—70)" 9] = (-1)"*%o(—i0)"6 => (-1)"*F = 1 


where (—1)? is the statistical factor for ¢ (1 for bosons, —1 for fermions), and we 
have included the appropriate 2’s for hermiticity of the action. Thus, integer spin is 
associated with bosons ((—1)" = 1 = (—1)?), and half-integer with fermions ((—1)" = 
—1 = (-1)?). This is the “spin-statistics theorem”. By using real fields with a 
diagonal kinetic term, we have implicitly assumed kinetic terms appear only with the 


correct sign: For example, for a complex bosonic field 
¢-A+iB = ¢$10¢6—AidB=—BiOA 


and thus has indefinite sign. Thus, spin and statistics follows from Poincaré invari- 
ance, locality, and unitarity. If we drop unitarity, we get “ghosts”: We’ll see examples 


of such wrong-statistics fields when quantizing gauge theory. 


Note that demanding unitarity (the right sign of the kinetic term) is the same as 
demanding positivity of the true energy, as least as far as the kinetic term is concerned: 
The energy is given by the Hamiltonian of the field theory; if the kinetic term changes 
sign, the corresponding contribution to the Hamiltonian does also. (Compare the 


discussion of Wick rotation of the action in subsection VB4.) 


Using anticommuting fields to describe fermions is more than a formality. In gen- 
eral, the significance of describing states by quantizing classical fields that commute 
or anticommute has two purposes: (1) to avoid multiple counting for indistinguish- 
able particles, and (2) to insure that two identical fermions do not occupy the same 
state. Thus, when describing two particles in different states, the phase associated 
with (anti)commutation is irrelevant: A “Klein transformation” can be made that 
makes anticommuting quantities commute for different states, and anticommute (i.e., 
square to zero) only for the same state. However, such transformations are nonlocal, 


and locality is crucial in relativistic field theory. (See exercise 1A2.3e.) 
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6. Cutting rules 


For some purposes it is useful to translate the three defining properties of relativis- 
tic quantum field theory into graphical language. Poincaré invariance is trivial, since 
the propagators and vertices are manifestly Poincaré covariant in covariant gauges. 
Unitarity and causality can also be written in a simple way in functional notation. We 
first note that the inner product for free multiparticle wave functions can be written 


very simply in momentum space as 


(lx) = (W'idle?*x[))|g9. = D+ = / dp aa Asm 
Alp) = _ tN (p) = A,(p) = 0(p°)276[4(p? + m?)]|N(p) 


3(p? + m? — ie) 
where w and x are products of positive-energy single-particle states, and A, (the 
“cut” propagator) projects onto the positive-energy mass shell. (The exponential 
takes care of the usual combinatoric factors.) We have written a generic propagator, 
with numerator factor N (=1 for scalars). (Without loss of generality, we have as- 
sumed a real basis for the fields, so N can be taken as real.) The S-matrix amplitude 


then can be written in operator language as 


¥[d] = v'[dlxid] > A= [ De wel = 4) 7] 216] = wish) 


} 
56 
We have used the fact that positive-energy states propagate forward in time and 
negative backwards to write the usual Hilbert-space inner product in terms of initial 
and final states of positive energy. The S-matrix operator S appears because y and 
w satisfy the free equations of motion, and S performs time translation from t = —oo 


for y to t = +00 for w to include interactions. 


The unitarity condition is then (see subsection VA4) 
SIS=1 => Z[dle?+Z[d] =1 


while causality is 


oo +O a oo ee 2 
say (Sl say) oe samy (20 aC) 


This causality relation, which already holds in nonrelativistic field theory, can be 


z(¢\) =0 forz®>y° 


strengthened by using Lorentz invariance: If (x —y)? > 0 (spacelike separation), then 
x° < y® can be Lorentz transformed to 2° > y°. Thus, the above expression vanishes 


everywhere except on or inside the backward lightcone with respect to x — y. These 
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functional forms of unitarity and causality (and Poincaré invariance) can also be used 
as a basis for the derivation of the functional integral form of Z[] in terms of the 
action, rather than relying on its relation to the Hamiltonian formalism. 

The fact that these conditions are satisfied by Feynman diagrams follows easily 
from inspection. We examine them using the explicit expression for Z following from 


the functional integral: 


Zlgj=ePe**, = Z[g]t = eel 
4 J oe ees ee 
Df GAM aay 40) ~ TERE 
D3 far 4g 40 = TET 


These expressions can be translated straightforwardly into position space as 
6 6 é } 
dp ——~A syn fee —— A(x — 7’) ———~ 
ar omaes 5aay See 


From the results at the end of subsection VB2, we see that the propagators satisfy 


etc. 


the relations 
A, (xz) = A(z) — Ag(x) = A*(z) + Ap(z) 
A,(«)=A(-2), Ale)=A(-2), Ar(x) = Aa(-2) 


9 < 0. We will now see that the cancellations in 


and, of course, Ar(x) = 0 for x 
the unitarity and causality relations occur graph by graph: There are contributions 
consisting of a sum of terms represented by exactly the same diagram, with each 
term differing only by whether each vertex comes from Z or Z'. First, this affects 
the sign of the term, since each vertex from Zt gets an extra sign from the e’! in Zt, 


as compared to the e~*! in Z. Second, this affects which propagators appear: 
(t,y) in: (2,2) > A(x —y) 

(Z',Z) 

(Z, zh) A_(x—y) 

(Z', Z") + A*(z—y) 


+ Ae =y) 
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Now if we sum over two terms differing only by whether the position x of one 


particular vertex appears in Z or Z', the result before integration is proportional to 


[1 4@-») []4-@- 2) -[]4.@-w [] 4*@- 4) 
where y; are from Z and z; are from Z'. However, 
A(z —y)-— Ay(@—y) =A_(x—y) - A*(@—y) =Arnly—2)=0 for x >y! 


Writing A = A, + Ar and A_ = A* + Ap in the difference of the two products, 
each surviving term in the difference contains a Ap, and therefore the two products 
cancel if x° is the latest of all the vertices. We thus take any sum of the same 
diagram over different distributions of the vertices between Z and Z' occuring in the 
unitarity or causality relation before integration over the coordinates of the vertices, 
separate the sum into pairs which are identical except for whether the latest vertex is 
in Z or Z', and apply the above relation to show this difference vanishes. Thus, the 
vanishing of a sum of graphs indicated by unitarity or causality is actually satisfied by 
cancellation between each pair of terms before integration over coordinates. (Which 
pair is determined by the values of the coordinates, since we need to find the latest 
one; after integration, the cancellation is between the whole set of terms for the same 
graph.) In the unitarity relation we sum over whether a vertex occurs in Z or Z" for 
each vertex, including the latest one, so that condition is easily satisfied. (The only 
diagram that survives is the one with no particles, which gives 1.) In the causality 
relation we perform this sum for each vertex except y (since 6/d¢(y) acts only on Z), 


but since y° < x°, y° is not the latest vertex, so again the latest one is summed over. 


-Of--<q]- O) - = 0 


Exercise VC6.1 
Consider an arbitrary 1-loop graph. Why would replacing all the propagators 
A with advanced propagators A, (or all with retarded propagators Az) all the 
way around the loop in the same direction give zero? Use this result, and the 
relation between the various propagators, to show that any one-loop diagram 
(with normal propagators) can be expressed as a sum of products of tree 
graphs, with some summations of external states (“Feynman tree theorem” ). 
How does this differ from the cutting rule for unitarity? (Hint: Look at the 


signs of the energies of external states.) 
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There is one fine point in this construction: We may use Feynman rules from a 
complex action, such as those used for massive theories in subsection HIC4, or when 
using complex gauges (see section VIB). In that case, since the S-matrix S is gauge- 
independent, and the original action S was real (before eliminating complex fields 
or choosing complex gauges), it is legal to use the Hermitian conjugate action S‘ to 
define the Feynman rules for St (and Z'): When multiplying S'S, we use the usual 
rules to find the second factor S, and the conjugate rules to find the S used in the 
first factor S': 

S'S = [S(S")]'S(S) 


The result of conjugating the S-matrix then will be to complex conjugate twice, and 
return rules identical to those used for S, except for the differences noted above for 
real actions. That means that the above proof of the cutting rules goes through 
unmodified, where we use the same complex rules in the entire diagram, regardless of 
whether they are associated with Z or Z'. In particular, this means that vertices from 
the two parts of the graph will differ only by sign (conjugating just the 7 in e’°’, not the 
S;), and propagators will differ only by their (momentum-space) denominators, not 
their numerators. This is particularly important for the complex fields of subsection 


IIIC4, since otherwise even the types of indices carried by the fields would differ. 


7. Cross sections 


In quantum physics, the only measurables are probabilities, the squares of abso- 
lute values of amplitudes. Since we calculate amplitudes in momentum space, proba- 
bilities are expressed in terms of scattering of plane waves. They are more naturally 
normalized as probabilities per unit 4-volume (or D-volume in arbitrary dimension), 
since plane waves are uniformly distributed throughout space. This can be seen ex- 
plcitly from the amplitudes: Because of the total momentum conservation 6-function 


that appears with each connected S-matrix element S;;, we have for the probability 


P 
Spi = 10 (dp) (ue 
= P= (Spl =p (Sop) 80) = [Fpl (Op) ae 


where we have found the coordinate D-volume by the Fourier-transform definition of 


s(0) = fari= foo i= oe 


the 6-function: 
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A “cross section” is defined as a probability for the scattering of two incoming 
particles into some number of outgoing particles. The scattering is “elastic” if the 
two final particles are the same as the two initial particles (they exchange only 4- 
momentum), “inelastic” otherwise. Generally one particle is in a beam directed at a 
target (at rest in the lab frame) containing the other “incoming” particle, but in some 
experiments two beams are directed at each other. In either case the cross section is 
defined by the rate at which one particle interacts divided by the flux of the other 


particle, where 


rate of arrival 


flute = = (density) x (relative velocity) 


area 


and thus the “differential cross section” (yet to be integrated/summed over final 


states) is 


P 1 
do = — X 
Vp P1P2V12 


The “total cross section” o is then given by summation over all types of final states 


and integration over all their remaining momentum dependence. 


The spatial density p is the integrand of the spatial integral that defines the inner 


product: From subsection VB2, for bosonic plane waves (w,(x) = e’?”) we have 


‘ Le 1p 1, — lp 
_ * 

= = Gay oP 
where w = |p°|. (The same result can be obtained for fermions, when their external 


line factors are appropriately normalized.) 

The expression for do is actually independent of the frame, as long as the 3- 
momenta of the two particles are parallel. This is the case for the most frequently 
used reference frames, the center-of-mass frame and the “lab frame” for either particle 
(where that particle is at rest, as is the lab if that particle is part of a target). Then 


Pi Po | 


(wywev12)? = Wy Wy |—— — We = |wepi — wy po)? a —$(P1jaP26))” = Nis 


using again the Mandelstam variables and ,,; introduced in subsection [A4. Finally, 
we include the “phase space” for the final states to obtain 


— pp a l2m)?8? (Sap) pp _ dp 2 
do = |T;| Nes Ll aye Il Hi 


f:n ident 
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where the first product is over all final one-particle states, and the second is over 
each set of n identical final particles. The normalization again follows from the inner- 
product for plane waves: By Fourier transformation, d?~!x p > d?~!p/(2r)?~'p. 
It also appears in the “cut propagator” A, used in unitarity, as in the previous 


subsection: ppl 


f abe Ow 2rAIBe +m?) = f SP 
(29)? 2 (27) P/2-lyy 

The simplest and most important case is where two particles scatter to two par- 
ticles. (This includes elastic scattering.) The “differential cross section” da/dQ, 
where dQ is the angular integration element for p3, is found by integrating do over 
d?~1p, and d|p3|. The former integration is trivial, using the 6 function for (D — 1)- 
momentum conservation. The latter integration is almost as trivial, integrating the 


remaining 6 function for energy conservation: 
=i 
6(|P3| — |Pslo) 


ayo p® 


Ops 


a's 5 (Sp) = d 2 dlp (ps)? 


wr = W1 + W2 — 4/ (p3)? + m3 — 4/ (Pi + Be — D3)? + m4 
ODP? _ =3(s — m3 — mj )ws + m3wy 
A|ps| |P3|w3Wa 


where |p3|9 is |p3| evaluated as a function of the remaining variables at S~ p® = 0. We 


then find F |p: 
h P3 

ren ee  ——— 
in ( T) | fil A12[5(s — m3 — m3)ws — mew] 


[es 


The center-of-mass frame (see subsection IA4) is the simplest for computations. 


In that frame the differential cross section simplifies to 


do _ 4 & ae 
ins (27)°|T¥;| kee 
and in particular in D=4: 4 
o 34 
a Say ta 
fo 


Another convenient form for the differential cross section is do/dt, trading 6 for t and 
integrating out the trivial dependence [dd = 27 (for D=4). In the center-of-mass 
frame we have 


df? = 2md(cos @) = dt 


8 
TT —_— 
Ay2A34 
Since t is Lorentz invariant, we therefore have in all frames (that conform to our 


earlier requirement for the Ag factor in do) 
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For example, for the 4-point scalar example considered in subsection VC4, we 


have 


do _— (4m)8g4 ( 1 1 1 ) 


dt s(s —4m?) \s—m? | t—m? © u—m? 


Exercise VC7.1 


A simpler example than the cross section is the decay rate, 


for initial density p (where t is now time). 


For the case of decay of a particle of mass M into 2 particles of masses m1, M2 


in D=4, show that in the rest frame 


dP A12 ‘ 
— = 4r——|T}; 

ae AT lta 

(with a factor of 5 for final identical particles). What happens when M = 
M+ Mag? 

For the case of coupling S; = f dx g¢i¢2¢3, evaluate the classical (tree) 
contribution in terms of g,M,m ,,m2. Give the dimensional analysis of the 


result. Consider also the case where the final particles are massless. 


Exercise VC7.2 


Consider the cross section for elastic scattering of two particles to lowest order 
(tree graphs), in four dimensions. For the following, consider the nonrelativis- 


tic limit (small velocities). 


Show from the definition that classically the “total cross section”, as indi- 
cated by the name, is just the cross-sectional area with respect to the beam 


(assuming each “arrival” results in an “interaction” ). 


A ¢* interaction correpsonds to a 6-function potential in classical mechanics, 
since it has zero range. In classical mechanics, such “billiard ball” scattering 
is purely geometrical, depending only on the “size” of the balls. Find the 
effective “radius” of these classical billiard balls in terms of their mass and 
coupling. 

Replace the $4 interaction with a ¢?x interaction, where y is an intermediate 
particle with a different mass (nonzero, and also much larger than the @ 
kinetic energies). (We still consider scattering of ¢ particles.) What is the 


effective radius of the @ particles? 
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d In the limit where the x mass becomes infinite, but also its coupling, the cubic 
interaction is effectively replaced with a $4 interaction (finite when the limit 
is taken appropriately). What is this coupling constant in terms of the cubic 


coupling and y mass? 


For some purposes (such as considerations of unitarity and causality, as in the 
previous subsection) it is useful to draw the Feynman diagrams for the cross section 
itself (or actually |T|?). In such a diagram we draw one of the diagrams from S 
and one from St, separating the two by a line (dashed, zig-zag, or shaded on one 
side, according to your preference), and connecting all the external lines (initial and 
final) on one side to the corresponding ones on the other. The result is a bubble 
diagram with a “cut”: The “cut propagators” are A, = 276(p°)N6[3(p? + m?)| 
(or A_, depending on how we label the momenta), corresponding to the propagator 
A = N/3(p? + m?), if we sum over all polarizations; otherwise N is replaced by a 
term in the sum N = So pty. The momenta of the cut propagators may not be 
integrated over, depending on whether they represent final states whose momenta 
are summed over (i.e., not measured; in practice the momenta of initial particles 
are always measured). The only other difference in the Feynman rules from the S- 
matrix is that in Wick rotating back S gets the usual m? — m? — ie while S' gets 
m? — m? + ie, and each connected graph in S gets an id(S*p) while each in S' 
gets a —id()>p). The algebra for the cross section is thus identical to that of a 
vacuum bubble (although the momentum integration is not, and the cut propagators 
lack the usual denominators). In particular, instead of summing over just physical 
polarizations in a cut vector propagator, which corresponds to using a unitary gauge, 
we can include ghosts in the external states, and use any gauge: This follows from 


the cutting rules derived for unitarity. 
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8. Singularities 


We know from free theories that any propagator has a pole at the classical value of 
the square of the mass. This statement can be extended to the interacting theory: The 
(“Landau”) singularities in any Feynman diagram are exactly at classically allowed 


(on shell) values of the momenta. 


The simplest way to see this is to write the propagators in a way reminiscent of 
the classical theory, where the appearance of the worldline metric in the action results 


in the (Wick-unrotated) Schwinger parametrization of the propagator, 
oe ee i ir exit (ptm?) /2 
gp? +m?) Jo 
For simplicity we consider a scalar field theory with nonderivative self-interactions. 


The corresponding form of a Feynman diagram, written in momentum space by 


Fourier transformation, is then 
/ dx'.dp;;dT;; a7 ar, [rig Bi, t9n?)/2—(@s—25) Pag 
geDag Tig 


where 7, 7 label vertices (and external endpoints), (ij) labels links (propagators), and 


dx’ indicates integration over just vertices not attached to external lines. 


Integration over these x’’s produces 6 functions for momentum conservation: 
5 pij = O 
J 


i.e., the sum of all momenta flowing into any vertex vanishes. (Note p;; = —p,;.) This 
constraint can be solved by replacing momenta associated with each propagator with 
momenta associated with each loop (and keeping momenta associated with external 


lines). For example, for planar diagrams, we can write 
Pij = PIT = kr —ky 


where IJ, J label loops: p;,; labels the propagator by the two loops on either side, 
rather than the vertices at the ends (and similarly for 7; = 7,7). Similar remarks 
apply to nonplanar diagrams, but the parametrization in terms of loop momenta k; is 
more complicated because of the way external momenta appear. (In the planar case, 
the above parametrization can also be used for external momenta, and automatically 


enforces overall momentum conservation.) The Feynman integral then becomes 


[Kans ear, tra (pra(k)?2+m?]/2 
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where k are loop (k’) and external momenta after solving the conservation conditions. 


We can now treat the exponent of the Feynman diagram in the same way as a 
classical mechanics action, and find the corresponding classical equations of motion by 
varying it. By the stationary phase approximation (or steepest descent, after Wick 
rotation of 7), the classical solutions give the most important contribution to the 
integrals (at least for weak coupling). This approximation is related to long-distance 
(i.e., infrared) behavior (see exercise VB4.3a). Taking account of the fact that the 7’s 
are constrained to be positive (by treating 7 = 0 separately or making a temporary 


change of variables 7 = 3? or e® to an unconstrained variable), we find 
Try(p7y +m) = 0, » TrPrs = 9 
Z 


or, in terms of the original variables, 


Tii(Pig tm) =0, pig = a nt! 
7 
These are known as the “Landau equations”. Their correspondence with classical 
configurations of particles follows from treating 7 as the proper time, as seen from 
p = Ax/Ar. (Actually, it is the generalization discussed in section IIIB of proper time 
to include the massless case; in the massive case s = mr.) The equation T(p?-++m?) = 0 
says that either the particle for the line is on-shell or there is no such line (it has 
vanishing proper length), while the equation }°>,7p = 0 says that the sum of tp 


around a loop vanishes, another statement that pArt = Az. 


Exercise VC8.1 
Consider the “one-loop propagator correction”: the graph where a single par- 
ticle of mass M splits into two of masses m; and m2, which rejoin. What is 
the physical condition relating these 3 masses (with all particles on shell to 
satisfy the above)? What happens when M = m, + m2? What if m2 = 0? 


Note that these physical singularities are all in physical Minkowski space: In 
Euclidean space, p?-+m? is positive definite, so it never vanishes (except for constant, 
massless fields p = m = 0). Furthermore, in Euclidean space, one can always rotate 
any momentum to any direction, whereas in Minkowski space one can never Lorentz 
transform to or through either the forward or backward lightcone. Thus, calculating 
in Euclidean space makes it clear that S-matrix elements for positive-energy states 
are given by the same expressions as those for negative-energy states: To compare 
two amplitudes that are the same except for some final particles being replaced with 


initial antiparticles, or vice versa, we just change the sign of the energy. This is 
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called “crossing symmetry”. In the case where all particles are reversed, it is CPT 


invariance (“CPT theorem” ). 


9. Group theory 


Although the manipulation of spin indices in Feynman diagrams is closely tied to 
momentum dependence, the group theoretic structure is completely independent, and 
can be handled separately. Therefore, it is sufficient to consider the simple example 
of scalars with a global symmetry. (The more physical case of chromodynamics will 
be the same with respect to group theory, but will differ in dependence on momentum 
and spin.) The simplest case is the U(N) family of groups. We choose an action of 
the form 

L = 4tr[2(06)? + V(9)] 


where ¢ is a hermitian NxN matrix. This action is invariant under the global U(N) 
symmetry 


¢ = UU" 


A simplification we have chosen is that the interaction has only a single trace; this 
is the case analogous to pure Yang-Mills theory. (We also included the coupling 


constant as in Yang-Mills.) 


gluon quark 


When we draw a Feynman diagram for this field theory, instead of a single line for 
each propagator, we draw a double (parallel) line, each line corresponding to one of 
the two indices on the matrix field. Because of the trace in the vertex, the propagator 
lines connect up there in such a way that effectively we have continuous lines that 
travel on through the vertices, although the two lines paired in a propagator go their 
separate ways at the vertex. These lines never split or join, and begin or end only on 


external fields. We can also draw arrows on the lines, pointing in the same direction 
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everywhere along a single line, but pointing in opposite directions on the two lines 
in any propagator pair: This keeps track of the fact that @ appears in the trace 
always multipled as ¢¢ and never as ¢¢7. A physical picture we can associate with 
this is to think of the scalar as a bound-state of a quark-antiquark pair, with one 
line associated with the quark and another with the antiquark; the arrows are then 
oriented in the direction of time of the quark (which is the opposite of the direction 
of time for the antiquark). The quark is thus in the defining representation of U(N) 
(and the antiquark in the complex conjugate representation). Group theory factors 
are trivial to follow in these diagrams: The same color quark continues along the 
extent of a “quark line”; thus, there is a Kronecker 6 for the two indices appearing 


at the ends of the quark line (at external fields); each quark is conserved. 


Exercise VC9.1 
Using the quark-line notation, where the lines now represent flavor, draw all 4- 
point tree graphs, with 3-point vertices, representing scattering of Kt K~ — 
ma (see subsection IC4) via exchange of other (pseudo)scalar mesons in 
that U(6) multiplet. What are the intermediate states (names of mesons) in 
each channel? Note that the “flavor flow” of both diagrams can be represented 
by a single diagram, by separating all pairs of intermediate lines to leave a 
square gap in the middle: This “duality diagram” represents the mesons as 
strings; the gap between quarks represents the “worldsheet” . 

Exercise VC9.2 
Consider quark-line notation for doing group theory in general: (Calculate 
using only graphs, with numerical factors — no explicit indices or Kronecker 
6’s, except for translating definitions. ) 

a Write the structure constants for U(N) as the difference of two diagrams, by 
considering a vertex of the form tr(A[B, C]). 

b Find pictorially the resulting expression for the Cartan metric (see subsection 
IB2): Show that it is the identity times 0 for the U(1) subgroup and 2N for 
the SU(N) subgroup (as found previously in subsection IIIC1). 

c Also use these diagrams to prove the Jacobi identity (i.e., find the resulting 
6 diagrams and show they cancel pairwise). 

d Derive diagrammatically the value of d;;, of exercise IB5.3b for the defining 
representation. 


Exercise VC9.3 


Consider the group theory factors for the above scalar theory, with only a 


370 V. QUANTIZATION 


cubic interaction. Draw all the 1PI 1-loop diagrams with 4 external double- 
lines, and rewrite the corresponding factors in terms of traces and products 
of 4 fields, including factors of N. (Be careful to include all permutations of 


connecting propagators to vertices. ) 


In general, we find that connected Feynman diagrams may include diagrams that 
are disconnected with respect to the above group-theory diagrams, where we consider 
the two group-theory lines on an external line to be connected. Such diagrams corre- 
spond to multiple traces: There is a factor of the form tr(G;G;...G;,,) corresponding 
to each connected group-theory graph, where 7, 7,...,/ are the group-theory indices 
of the external lines (actually double indices in the previous notation). However, all 
connected trees are group-theory connected. Furthermore, they are all “planar”: Any 
connected U(N) tree can be drawn with none of the quark lines crossing, and all the 
external lines on the outside of the diagram, if the external lines are “color-ordered” 
appropriately. Of course, one must sum over permutations of external lines in the 
T-matrix because of Bose symmetry. However, in calculating W[¢] one need consider 
only one such planar graph, with the external lines color-ordered; the Bose symmetry 
of the fields in W automatically incorporates the permutations. (Similar remarks 


apply to loop and nonplanar graphs.) 


As an example, consider the U(N) generalization of the 4-point tree example of 
subsection VC4. The Lagrangian is now (scaling the coupling back into just the 


vertex) 
L = tr{$[(0¢)? + m?¢"] + 390°} 
where the interaction term now has a combinatoric factor of 7 instead of 7 because 


it is symmetric only under cyclic permutations. The result for W is now modified to 


2 1 42 1 , 

W =—gtr [ de $6 Tee ao)” 
This analysis for U(N) can be generalized to SU(N) by including extra diagrams 
with lines inside propagators short-circuited, representing subtraction of traces. It can 
also be generalized to SO(N) and USp(2N): In those cases, antisymmetry or symmetry 
of the matrices means the lines no longer have arrows, and we include diagrams 
where the lines inside the propagators have been “twisted”, with signs appropriate 
to symmetrization or antisymmetrization. Generalization for all the above groups 
to include defining representations is also straightforward: Such fields, like the true 
quarks in QCD, carry only a single group-theory line. For example, sticking with 


our simpler scalar model, we can generalize to a scalar theory with the same group 
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theory as chromodynamics, with “free” quark fields, appearing as scalar fields in the 


defining representation of the group: 


LoL+tr[5(Oy)- (00) +0'f@ul, v= Upu;* 


where ~ is an NXM matrix with M flavors, and Uy is the flavor symmetry. (This 
color+flavor symmetry was treated in subsections IC4 and IVA4.) This field has a 
propagator with a single color line (with an arrow); however, we can also use another 
double-line notation, where ~ propagators carry one line for color and another for 
flavor. This method can be generalized to arbitrary representations obtained by 
direct products of defining representations, (anti)symmetrizations, and subtractions 
of traces, by giving the propagators the corresponding number of lines (though usually 


two lines are sufficient for the interesting cases). 
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_ VI. QUANTUM GAUGE THEORY _ 


We now consider special features of spin and gauge invariance, and introduce some 
special methods for dealing with them. In quantum theory, gauge fixing is necessary 
for functional integration: Gauge invariance says that the action is independent of 
some variable; integration over that variable would thus give infinity when evaluating 
amplitudes for gauge-invariant states. Eliminating that variable from the action (a 
“unitary gauge”) solves the problem, but not always in the most convenient way. (If 
it were, we wouldn’t have introduced such a redundant description in the first place.) 
Note that such infinities already appear for global symmetries: For example, the 
functional integral with wave function 1 (vacuum-to-vacuum amplitude) is infinite by 
translation invariance. This infinity is easier to understand for a nontrivial amplitude 
in momentum space, as a factor of a momentum-conservation 6-function (which is 
either co or 0, but necessarily co for the vacuum amplitude, which has vanishing 


momentum because the vacuum is translationally invariant). 


eet A. BECCHI-ROUET-STORA-TYUTIN ....... 


We have seen the relationship of gauge invariances to constraints in subsection 
IITA5. In this section we consider the quantization of constrained systems, and its 
application to gauge theories. The Becchi-Rouet-Stora-Tyutin (BRST) method is 
not only the most powerful, but also the easiest way to gauge fix: It replaces the 
gauge symmetry with an unphysical, fermionic, global symmetry that acts only on 


unphysical degrees of freedom. 


1. Hamiltonian 


Physical observables commute with the constraints. Thus, time development is 
described by the gauge-invariant Hamiltonian H,;, or we can set the gauge fields \' 


equal to some arbitrary functions of time f‘(t) as a “gauge choice”: 
N=f' => H=HAytf'G; 


In quantum mechanics, physical states should be annihilated by the constraints. How- 
ever, more generally we can require only that these states satisfy the constraints 


through expectation values: 
(¥|Gi|x) = 0 


This condition is satisfied by dividing up the constraints into: 
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(1) a subalgebra Go that annihilates all physical states, 


(2) complex “lowering operators” G_ that also annihilate these states, and are a 


representation of the subgroup generated by Go, and 
(3) their hermitian conjugate “raising operators” G, = G_t. 


(We treat +, —, and 0 here as multivalued indices.) Thus, 
(Go — const.)|p) = G_|p) = (W|(Go — const.) = (Y|G+ = 0 


where we have allowed for “normal-ordering” constants to be included as eigenvalues 
of (an Abelian subeset of) the Go constraints. In the Abelian case, it then follows 


that, although G do not annihilate these states, they generate gauge invariances: 
d\v) = Gy IC"), (Go — const.’)|¢*) = G_|¢*) = 0 


preserves the inner product of such states as well as the constraints on |q). This may 


include “residual gauge invariances” that survive in a solution to the constraints. 


Unfortunately, things get more complicated in the nonabelian case. For example, 


the gauge invariance and constraints above are no longer compatible in general: 
0 = G_d|b) = G_G4|¢7) = [G_, G4]|¢7) = -if_4*G41¢7) 40 


However, one example that doesn’t have this problem is the simple case where 
there are only 3 constraints, forming an SU(2) algebra (so f_4+ = 0): If we choose 
G_ to be the lowering operator and G', to be the raising operator, then the constant 
appearing for Go is simply the lowest eigenvalue in some irreducible representation in 
the Hilbert space |w), and the constraints pick out the corresponding state (or states, 


if there is more than one representation with that “spin” ). 


A convenient way to deal with this problem is to replace the nonabelian algebra 
G; with a single operator, which is therefore Abelian. We define a BRST operator Q 
that imposes all constraints G; by adding a classical anticommuting “ghost” variable 


c’, and its canonical conjugate b;, 
{bi, cl} = oH 


for each constraint: 
Q = eG: = igcic! fia" by 


The second term has been added to insure the Poisson bracket or commutator 


{Q,Q} = 0 
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so that its crossterm cancels the square of the first term, while its own square vanishes 
by the Jacobi identity Siig! fr” = 0. Quantum mechanically, the BRST operator is 
nilpotent: 

quantum mechanically {Q,Q} = 2Q? =0 


We can also describe the ghost dependence by the “ghost number” 
20h = [2,O)=0 


(Quantum mechanically, we need to normal order these expressions for Q and J.) 
Each anticommuting ghost and its conjugate will serve to “cancel” each commuting 
constraint and its conjugate gauge degree of freedom. (Similarly, bosonic ghosts are 


introduced for fermionic constraints, so each term in Q is fermionic.) 


The BRST operator provides a convenient method to treat more general gauges 
than 4 = f, such as ones where the gauge fields become dynamical, which will prove 
useful particularly in relativistic theories. Now the original physical observables A 


will satisfy 
[G;, A] = [b, A] =[c’, AJ =0 = [Q,A] =[J, A] =0 
and similarly the physical quantum mechanical states |~) will satisfy 
(Go, G_;bo,b-5e")|¥) =0 => Q|b) =Jly) =0 


where we have used the fact the only nonvanishing structure constants are foo’, fo.*, 


fo, fast, f--~, and f,—*. (In the quantum case there are also some subtleties 


due to normal ordering.) We also have the gauge invariances 


6A = {Q, A}, 4|~) = Q|A) 


for arbitrary operators A and (unrelated) states |\), since the Q-terms won’t con- 


tribute when evaluating matrix elements with states annihilated by Q: 


(Cal + (Ar]Q)(A + 1Q, A}) (l2) + Q]A2)) = (hi Alva) 


The gauge invariances are consistent with the constraints because of the nilpotence 
of the BRST operator. (So Q(|w) + Q]A)) still vanishes, etc.) States satisfying 


Q|v) =9, d/h) = QI) 


(i.e., we identify states that differ by Q on something) are said to be in the “coho- 
mology” of Q (“BRST cohomology”), and operators satisfying 


[Q, A] =0, dA = {Q, A} 
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are said to be in its “operator cohomology”. (The latter cohomology also has a 
classical analog.) 
Exercise VIA1.1 
Assume that each physical state can be represented as a physical observable 


(Hermitian operator) acting on a ground state, which is itself physical: 
lv) =A]0), A=Al,  QI0)=0 


Show how this relates the gauge parameters and cohomologies of |q) and Q. 
The BRST operator incorporates the ghosts that are necessary to generalize treat- 


ment of the constraints to the nonabelian case: For example, to reproduce the gauge 


transformations of the Abelian case, we choose 


Mahle oH 0 > de) =—QD)=—E,\C) 


where 
Gi = {Q,b} =Gi- ic! f ja* be 
are the “gauge-fixed” constraints, which include an extra term to transform the ghosts 


as the adjoint representation. They reduce to just G; in the Abelian case, but add 


ghost terms to the gauge transformation law otherwise. 


In particular, the Hamiltonian is a physical operator describing the energy and 


the time development, so we can write 

H=Hit{Q,A}, [Q,Hal=0 > T (et oe (“J ) + {Q,n} 

for some k. This includes gauge fixing for the gauge \’ = f’ discussed above, using: 
A= fi, > H=Hgt+f'Gi, L=—(G"Pm—itb)) +H 

The ghost terms in er only affect the time development of unphysical states in this 

gauge. 

For example, when calculating S-matrix elements, the result is independent. of 
the gauge choice A, as long as both the gauge-invariant Hamiltonian H,; and the 
states are BRST invariant. (H,; commutes with Q, the initial and final states are 
annihiliated by it.) It is also independent of the gauge choice |X) for |¢) — |w)+Q|A). 


(Such a “residual gauge invariance” persists even though the asymptotic states satisfy 


the free field equations.) 


In the cases of interest in relativistic physics, the constraints always consist of a 


linear term depending only on the canonical momenta p (conjugate to the fundamental 
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variables q), at least after some redefinitions, plus higher-order terms, which can be 
treated perturbatively. Therefore, as the simplest nontrivial example, we consider a 


model with a single variable g, with 


Agi = 0, G Pp > Q cp 


If we assume boundary conditions on the wave functions such that they can be Taylor 


expanded in q (they can always be expanded in c), we can write 


(oe) 


v(q) = (qv) — So (an ai CBn) 4a" 


n=0 


and similarly for A(q). We then examine 6|q) = Q|A), comparing terms with the 
same power of c and qg on both sides of the equation, to find da, and 6(,,. We then 
see that we can easily gauge (3, = 0 for all n by choosing certain coefficients in |A) 
to be —G, (so 6}, = Bx +68, = 0). Looking at Q|w) = 0, we then find that a, = 0 
for all n except n = 0, so only the constant piece of ~ survives. In other words, the 


cohomology is given by 


Qy) =0, dv) =Q)A) = pip) = |p) = 0 


So, solving for the cohomology of Q = cp is the same as solving the constraint py = 0 


without ghosts. 
Exercise VIA1.2 


Consider creation and annihilator operators satisfying 
[a, "| =[d, a") = {e,0'} = {b,c!} = 1 


(the other commutators vanishing): 


a Find the cohomology of the BRST operator 
Q=cal+cta 


by expanding in creation operators a‘, b', c!, d' about a vacuum state de- 
stroyed by the annihilation operators a, b, c, d. (This is the common alterna- 


tive to the boundary conditions used for Q = cp above.) 


b Compare the method of constraints used in the Abelian case: Ignore the 


fermions, and identify which bosons are constraints and how they are applied. 
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2. Lagrangian 


To obtain more interesting gauges we need some extra bosonic variables, such 
as the gauge fields \’ that we lost along the way, and their canonical conjugates 
(“Nakanishi-Lautrup fields”) B;, 


as well as their corresponding ghosts (“antighosts”) ¢;. We can do this in a trivial 


way by including constraints that set B to zero: 


Q = CG; — ihelc! fj*by + BB, J = c'b; — &b' 
where ¢; is conjugate to bi ; 
{é, '} = & 
As a simple example, consider 
A=} => {Q,A}=XG;, — id's, 
The action now includes the gauge fields and all the ghosts as dynamical variables: 
L = —(¢" Pm + 1B, — itd; — ibe) + Hy + {Q, A} 


For this gauge we can eliminate b and b by their equations of motion; assuming G; 
is only linear in p, we then can eliminate p to return completely to a Lagrangian 
formalism: 


L= Leg, X) a NB; = (Vic )G; 


where L,; represents the original gauge-invariant action (which depended on both q¢ 


and A, including time derivatives), and V; is the covariant (time) derivative: 
Vic = ou + OM fi," 


The gauge condition (from varying B) is now \= 0, generalizing the non-derivative 
gauges found without the antighosts and Nakanishi-Lautrup fields. Correspondingly, 


the ghost term is now second order in derivatives. 


Exercise VIA2.1 


Consider the general gauge choice 
A= Xb; + [F'(q,p) + E(B) é 
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where F" are some arbitrary functions of the original variables, and E’ are 
functions that effectively average over the types of gauges produced by F”. 
Find the gauge-fixed Hamiltonian and Lagrangian. In the case where F is 
linear in B, eliminate B, b, and 6 from the Lagrangian by their algebraic 


equations of motion. 


Now that we understand the principles, all these manipulations can be performed 
directly in the Lagrangian formalism. This will have the advantage that in field 
theory the Lagrangian is manifestly Lorentz covariant, while the Hamiltonian (or the 
Lagrangian in the Hamiltonian form —qp+ H) is not, because of the way it singles out 
time derivatives (and not spatial ones). (Consider, e.g., electromagnetism.) Similarly, 
neither the unitary gauge G; = 0 nor the temporal gauge ?; = 0 is usually Lorentz 
covariant. We can work with just the original variables g, A plus the new variables 
B,c,¢, and define Q by the transformation it induces (as derived from the Hamiltonian 


formalism): 
Qq” = big”, Qx = —i( +0 A* fx"), Qc a isc figs, Qé,=B;, QB;,=0 


where 6; is the gauge transformation induced by G; ([G;, | in the Hamiltonian for- 
malism). In deriving Q, we have used the equations of motion of p, b,b (which were 
eliminated). Note that the BRST transformations of the original variables are ex- 
actly the same as the gauge transformations, with the gauge parameters replaced 
with the corresponding ghosts. We can also consider the Nakanishi-Lautrup fields 
B as original variables, with the fact that they don’t occur explicitly implying they 
have constraints B = 0. Alternatively, we can treat the antighosts ¢ as pure gauge 
degrees of freedom, with their own nonderivative gauge transformation dé = \ that 
allows them to be completely gauged away. 


The Lagrangian can be gauge-fixed directly as 
L=Lg,+QAr 
where in the case just considered 
Ar = —\G; 


gives the same L as above for the \ = 0 gauge. In the simpler case described earlier 


(the gauge \ = function of t only) 
Ar = (XN — f& 
This gives the result, for the simplest choice f = 0, 
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after eliminating the Lagrange multipliers B and A by their algebraic equations of 
motion. Note that A, = \'é; corresponds to the Hamiltonian formalism’s A = 0. 
Thus, in the Hamiltonian formalism we never quantize with H = H,; + AG, but 
can use just H,; and A = 0, which is equivalent to using H,; + {Q, A} for any A, 
while in the Lagrangian formalism we can never quantize with just L,;(q, A), or even 
Lgi(g, 9), and Ay is never zero, but must be chosen so as to break the gauge invariance. 
However, the extra term for L,;(q, 0) is just the ¢b term found from converting H,; to 


the Lagrangian formalism. 


Exercise VIA2.2 
Repeat exercise VIA2.1 directly in the Lagrangian formalism. (Find Az, etc.) 


All our results for quantization apply equally well in the path-integral formalism, 
which can be applied to either the Hamiltonian or Lagrangian. (Of course, for field 
theory we will be interested in applying BRST to path integrals for Lagrangians.) 


We then evaluate matrix elements as 
A= [6 Wide 4. S=Sy+ QA, V=VitQdAv; QS = QUyi = 0 


S,; and W,; depend on just the physical fields (no ghosts), so they are gauge invariant 
as well as BRST invariant. For S-matrices, since VW is an asymptotic state, the BRST 
operator used for its constraint and gauge invariance can be reduced to its free part: 
Q then acts on only the gauge fields. The statement of gauge invariance of Wj; is then 
equivalent to the requirement that gauge fields appear in it only as their Abelian field 
strengths. For example, the usual gauge vector A describing electromagnetism ap- 
pears in single-particle factors in the wave functional (W[¢] = [| %[¢] as in subsection 
VC1) only as: 


[A] = (A*| Pa); dAg = —0,A, Ow, = DA\=0 


> 0 = 0% = (6A"|[a) = (Al|O%a) 


using Oo(A||w) = 0 (where the relativistic inner product ( || ) was defined in subsection 
VB2). The transversality of ¢_ is equivalent to coupling to the Abelian field strength, 


since 
Oe =0 > te=Pba > (Alba) = dF? | Iba) 


in terms of an antisymmetric-tensor external-line factor ap. 
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3. Particles 


We have seen that the relativistic particle (with or without spin) is a simple 
example of a contrained system. For the simplest case, spin 0, the BRST operator 


follows simply from the single constraint: 
Q = ¢5(O —m’) 


Unlike the nonrelativistic case, the relativistic “Hamiltonian” is identified with this 
constraint. Since we know constraints are treated by the BRST operator, we can 


consider writing the field theory action in terms of it: 


oo = - fav dc 5PQP 


Using the explicit c dependence of the field = @—icw, we find the usual scalar kinetic 
term. @ is thus the usual field, while w is an “antifield” , which has opposite statistics 
to @ (fermion instead of boson). We’ll see in chapter XII that Q can be constructed 
straightforwardly for arbitrary spin, and has a simple expression in term of generalized 
spin operators. (As in nonrelativistic theories, spin is easier to treat directly in 
quantum mechanics rather than by first-quantization of a classical system.) The 
kinetic term then generally can be written as a slight modification of the above. Then 
the antifields will be found to play a nontrivial function, rather than just automatically 


dropping out as in this case. 


From the constraints and their algebra for spin 1/2 (see also exercise IIIB1.2) we 
find the BRST and ghost-number operators: 


Q=c7(O-— m’) -—&(y-O-iB)-C b+ Eu, = T= eb + EC + EC 


where € and its conjugate ¢ are bosonic ghosts, and we have added a nonminimal 
term with boson € (conjugate ¢) and fermion ju (conjugate «) to allow gauges general 
enough for first-quantization: 


[¢, €] = IC, €] = {k, pu} =] 
For convenience, we also have chosen € (and ¢) to anticommute with 7, 
16.0} ={16 7} =0 


to avoid having to replace 15 with y_\2m; this has the natural interpretation of 
treating € and ¢ as bosonic (ghost) components of the y matrices (see subsections 
XITA4-5,B5). 
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Exercise VIA3.1 
Find Q and J for spin 1 as constructed from the direct product of 2 spin 1/2’s 
(see exercise IIB4.1d). 


Note that [Q,é] = 0, but {Q, A} # € for any A, so € is in the operator coho- 
mology of Q. Normally, this would imply infinite copies of the physical states in 
the cohomology, since applying a “translation” with the ghost variable € gives a new 
state in the cohomology from any given one. The nonminimal variables allow us to 
avoid this problem by combining € with € to produce harmonic oscillator creation 


and annihilation operators: 
f= =3(a +a"), _ <zi(a' —a), C= (4-4), C= —5i(@! + a) 
(a, @'] = [a,a'] =1, rest =0 
This allows us to define a ground state 
a|0) = aj0) = 0 


which breaks the translation symmetry of €. In chapter XII we’ll show in a more 


general framework how the 6Q@ type of action then reproduces the Dirac action. 


4. Fields 


As described in subsection VIA2, we can perform gauge fixing through BRST, 
including the introduction of ghosts, directly on the Lagrangian at the classical level. 
Also, the BRST transformations on the physical fields are just the gauge transforma- 
tions with the gauge parameters replaced by ghosts, and the BRST transformation on 
the ghosts is quadratic in ghosts times the structure constants, while on the antighosts 
it gives the Nakanishi-Lautrup fields, and it annihilates the NL fields. In the case of 
Yang-Mills we then have 


QA. =—[Va,C], QC =iC?, QC=-iB, QB=0 


while for matter transforming as 6¢ = iA\@ we have 


where we have used matrix notation for the group algebra, as usual. There are two 
minor differences from the transformation rules we used in our general discussion 


en 
a 


previously: (1) We have included an extra in our definition of the relativistic Q, 


for a convenience that will become apparent only when we relate relativistic first- and 
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second-quantization (see chapter XII). (2) There is a relative sign difference for QC 
because now Q is second-quantized while G’; is still first-quantized (i.e., matrices). 


More explicitly, we have, e.g., 
@PaHe,C]| = QC= —5 CIC" firs; Qo = iC'Gid 


The gauge-fixed action is then the gauge-invariant action plus the BRST trans- 


formation of some function A: 
Sof = Sa — IQA 


For example, consider Yang-Mills in the most common type of gauge, where some 


function of A is fixed: 


for some constant a. For a = 0, B is a Lagrange multiplier, enforcing the gauge 


f(A) = 0, while for a 4 0, we can eliminate B by its auxiliary field equation: 


Examples will be given in the following section. 


In field theory gauge-fixing functions always have linear terms, as do gauge trans- 
formations. Furthermore, there always exist “unitary gauges”, where no ghosts are 
required. The ghost terms in general gauges serve simply to provide the appropriate 
Jacobian factor for the field redefinition that transforms from the general gauge to 
the unitary gauge, which appears at the quantum level from functionally integrating 
out the ghosts. The simplest example is the trivial gauge invariance that occurs in 
the Stiickelberg model (subsection IVA5): 


QA=-0C, Q¢=mC, QC=-iB, QC=QB=0 
which we can fix with 
~iQ(CO¢) = —BO¢ + imCOC 


for some field-independent operator O. Functionally integrating out B still sets ¢@ = 0, 
but produces an inverse functional determinant of O (from redefinition of ¢, or from 
6(O@)), canceled by that from integrating out the ghosts. The advantage of BRST 


is that all this can be treated at the classical level, in terms of the classical action, 
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without regard to functional integration, while directly giving a solution that can be 
expressed immediately in terms of Feynman rules. 
Exercise VIA4.1 
Show that the gauge fixing 


~iQ(CO¢ + CAB + CB¢+ CCB) 


where O, A, B, andC are field-independent operators, gives a result equivalent 


to the previous, by considering functional determinants or field redefinitions. 


Exercise VIA4.2 
Show that the Lagrangian 


AAB+CBD — AABD 


by the field redefinition 
D—D+B'B 


for bosons A, B, C, D and operators A, B. This is the classical equivalent of 
det(AB) = det(A)det(B). 


These methods apply straightforwardly to supersymmetric theories in superspace: 
From the gauge transformations of subsection IVC4, 


Qe” = iCe” —ie"C; QC =iC?, QC =iC’ 
QC =-iB, QC=-iB; QB=QB=0 
where C, C, and B are chiral superfields, and C, C and B their hermitian conjugates. 


In practice, this BRST approach is sufficient for gauge fixing. In particular, this 
is true for the fundamental fields used in the standard model (including gravity), 
which have spin<2. Therefore, we’ll use mostly this approach in the rest of this text. 
However, some observed hadrons have much higher spin. The first-quantized approach 
of chapter XII gives a natural and direct way of understanding ghosts and BRST for 
the fields describing such particles, and translates directly into the treatment of Zinn- 
Justin, Batalin, and Vilkovisky (ZJBV) for field theory. 
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pecbole etna ercnrmetesaueids ice C7. 10 C2 |. eee enn reenesnemvedes 


There are two important properties of gauges we have examined: (1) Gauges 
which eliminate some degrees of freedom, such as lightcone or unitary gauges, are 
simpler classically, which makes them easier to understand physically. (2) Gauges that 
manifest as many global invariances as possible, such as the Fermi-Feynman gauge, 
will be found later to simplify quantum calcuations, because the explicit momentum 
dependence of the propagator or vertices is simpler, and keeping a symmetry manifest 
makes it unnecessary to check. In this section we’ll examine these gauges in greater 


detail, especially as they relate to intereacting theories. 


We'll study also some special gauges, with nontrivial interaction terms, that have 
both of these properties to some extent. In particular, they are manifestly Lorentz 


covariant, but avoid many of the complications associated with ghosts. 


1. Radial 


We know from nonrelativistic classical and quantum mechanics that the equations 
of motion can be solved exactly only for certain simple external field configurations. 
One particular case we have already emphasized is that of an action quadratic in the 
dynamical variables, i.e., the harmonic oscillator and its generalizations. Higher-order 
terms are then treated as perturbations about the exact solution. Such an expansion 
in the coordinates x is the particle version of the JWKB expansion in h: Calling the 
“classical” part of « “y”, we substitute 2 — y+ Vha and Taylor expand in x. (From 
now on we'll drop the h’s, and just remember to perturb about the quadratic terms. ) 
For the scalar field we write 


b> P+U-Ob + FUL" On Ond +». 


where O...0¢ is implicitly evaluated at y. 


For the gauge fields we would like to be a bit more clever: For example, for the 
electromagnetic potential A,, we know we can always add a constant, so A,,(y) is 
irrelevant, while for 0A only Finn = OmAnj is gauge invariant. This means we want to 
choose a gauge best suited to this calculation: a gauge that both eliminates as many 
as possible of the lower-order terms, and expresses A(y + x) in terms of only F'(y) 
and its derivatives. Similarly, we should have a Taylor expansion for charged fields 
in terms of covariant derivatives. The appropriate gauge, which easily can be found 
explicitly, is the “radial gauge” 


g- Aly +e) =0 
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(Note that, unlike F’, A depends on x independently of y, not just as y + 2, since the 
gauge condition itself is z-dependent. We write A(y + 2) only to indicate that A is 


evaluated at position y + x.) One way to solve this condition is to use the identity 
o" Fam = (£-O+1)Am — [Vm, 2° Al (0 = 0/02) 


which follows from the definition of F’. Using the gauge condition, we then can write 


1 


m= eee 
z-O+1- 


Alternatively, we can replace x everywhere (including the argument y+) by 7x, and 
then identify x -0=70/0r to find 


TE" Fel HTD) = OT Ay + Te) 


Integrating both sides over rt from 0 to 1, we find 


1 
Am(y + x) = | dt Tx" Fim(y + T2) 
0 


Note in particular that A(y) = 0. 


Another way to define this gauge is to consider gauge covariant translation from 
y to y+ 2x to produce a gauge transformation from an arbitrary gauge to the radial 


gauge. Writing the covariant derivative at y as 
D=D-+iA(y), D=d/0y 
we know from subsection IIIC2 that 
W (y +2) = e* uly) = ee? v(y) = ey + 2) 


so that covariant translation produces a w’(y +2) that is the same as (y+) up to 
a gauge transformation. Thus the gauge-transformed wy can be written as a covariant 


Taylor expansion (for purposes of perturbation) about y: 
Wey) = So ha™ ---2% (Da, --- Day ')(Y) 


In particular, u(y) = w(y). 
However, we want to define a covariant derivative with respect to x (not y), so 
that 


V=0+iA (ety), Vo'(yta) = (Db)'(y + 2) =e”? (Dy) (y) 
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Using 
Ov(y) = 0 
we find the solution 


V =e"? De®? mod & de?” 


where the latter term vanishes on w/(y + x), so the right amount of it can be added 


to the former expression to cancel any D terms. The result is 
V=e""(0+D)e*” 


This implies x - A’(y + x) = 0 directly: Contracting both sides with x, the Taylor 
expansion of the right-hand side terminates after the first couple of terms. Taylor 


expanding the uncontracted expression, we have 
Ay +2) =) aiaga(t: D)"2"Fealy) 


We can also write 
O+iA (e+y) HV Here FO + iAle tyler re *? =e“0+ 1A +y)le“ 


and 
V = eh Pe ¥ Deh e-FP 


Exercise VIB1.1 
Show this Taylor expansion is equivalent to that obtained from the first 
method used in this section to solve the gauge condition. (Hint: Look out for 
hidden x and y dependence — How does x: 0 on wy’ or F” relate to x - D? 
Also beware of notation: In the first construction we did not use a gauge 


transformation, so no primes were used.) 


Exercise VIB1.2 


Generalize this construction to superspace (see subsection IVC3): 


a First give an expression for the gauge potential A, in terms of covariant 
derivatives of field strengths F',p. 

b Then look at the expansion in just 6. Give the explicit result for the ex- 
pansions of Ag and A, about @ = 0, applying the constraints: This is the 
“Wess-Zumino gauge” (see exercise IVC4.2 for the Abelian case). 

Thus, to just quadratic order in x, the mechanics action for a relativistic particle 


in external fields (subsection IIIB3) becomes 


Sr & [ard Mma" + a ae” Frm (Y) 
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+v[9(y) +0" (Omd)(y) + 302" (Onn) (y)]} 


To this approximation the classical equations of motion are linear and can be solved 


exactly. It can also be used to find exact solutions for constant electromagnetic fields. 


2. Lorenz 


For purposes of explicit calculations in perturbation theory, it’s more convenient 
to use gauges where Lorentz covariance is manifest. “Lorenz gauges” are a class of 
gauges using 


f=0-A 


(and similarly for other gauge fields) as the gauge-fixing function. From the discussion 
of subsection VIA4, we have from the usual BRST as applied to Yang-Mills 


Lay = 1F2, — iQ 3[O(0- A+ }0B)] 
= -1A.01A-1(0. A)? — 4[A*, A’](-i0, An + 4Aa Av) —$BO-A—taB?-hiCd-[V,C] 
After eliminating B by its field equation, the kinetic terms are 


-1A-DA-4(0- A)? +24(0- A)? - HCOC 


4 


In particular, for a = 1 we have the “Fermi-Feynman” gauge, which gives the nicest 
propagators. (It is also the gauge that follows automatically from a first-quantized 
BRST construction, which will be described in chapter XII.) More generally, we find 
the propagator from inverting the kinetic operator: For the ghosts this is always 2/p’, 
but for Ag, 


2[n™p* + (= — 1)p*p')* = 2 a iat) ay 
For a = 0 this is the “Landau gauge”, which has the advantage that the propagator 
is proportional to the transverse projection operator. (It kills terms proportional to 
Da-) However, a = 1 is clearly the simplest, and the 1/p* term can cause problems in 


perturbation theory. 


Exercise VIB2.1 
In the Abelian case, consider making a gauge transformation on the gauge- 
fixed action (including matter), with \ ~ O7'B. Show that the only effect 
is to change the value of the coefficient a of the B? term. Find a similar 
transformation for the form of the action where B has been eliminated. This 


shows explicitly the decoupling of the longitudinal mode of the photon. 
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Exercise VIB2.2 
Show that for general A and B 


ab a,b oe = B 
(n A+ p*p’B) ~ A Tab ade OT, 


Note that in the Abelian case the lightcone gauge is a special case of the Landau 
gauge. (An analogous situation occurs in the classical mechanics of the particle for 


the gauges of the worldline metric, as discussed in subsection HIB2.) Here we have 
0 =n°(0°Fw) =n-O(0-A)—O(n- A) 


In the lightcone formalism, this is the field equation that comes from varying the 
auxiliary field. In the lightcone gauge n- A = 0, it implies 0- A = 0 (and thus also 


DA, = 0), since n- O is invertible. 


This is particularly useful in D=4, where we can generalize from the lightcone to 


a Lorentz-covariant form by using twistors: From subsection IIB6, 

Pat. > p™ scp pr, nr=0 => n%™* =e(n)nn* 
Massless spinors are described on shell in momentum space by (see subsection IIB7) 
yr=po, d= pd 
where external-line factors for Feynman diagrams are given by setting @ = 1. For 
massless vectors, we have p- A=n-A=0 (but n- p #0), so depending on whether 


the helicity is +1 (self-dual field strength) or —1 (anti-self-dual field strength), we 
find, respectively, 


- x ‘ ap _ 
tu = pp’, fe = ij ai Ace = np 
NY Py 
gs ; an 8 
few pp, = ft =0 > ABaEM 
Nps 


The normalization of A has been chosen compatible with |¢| = 1 and A*A*, = 1 for 
evaluating cross sections. In a general Landau gauge the arbitrary gauge-dependent 
polarization spinors n°, i can be chosen independently for each external line, since 
gauge invariance means independent gauge parameters for different momenta. (This 
method is known as “spinor helicity” .) However, in a lightcone gauge the polarization 


spinors are constant. 


The lightcone gauge condition is thus again a stronger gauge condition than 
Lorenz gauges, as expected from the fact that it has fewer derivatives. This difference 


shows itself in various ways: 
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(1) In perturbation theory on shell, in the lightcone frame the Landau gauge condition 
0=p:-A=-—p'A kills A~ but says nothing about At, which can be eliminated 


by the residual gauge invariance 6A* = p*) to obtain the lightcone gauge. 


(2) In perturbation theory off shell, more derivatives in the gauge transformation 
imply more derivatives in the ghost kinetic operator. Thus, more ghost degrees 
of freedom are introduced to cancel the extra unphysical degrees of freedom in 


the gauge field. 


(3) Lorenz gauges also have a nonperturbative ambiguity (the “Gribov ambiguity” ) 
that axial gauges avoid: Nonperturbative solutions to the gauge condition can be 
found that differ from the perturbative one, in the nonabelian case. Specifically, 


it is possible to find a nontrivial gauge transformation g (V’ = g-'Vg) such that 
0=0-A'=-id-g \(Vg) for 0:-A=0 
even when g is required to satisfy boundary conditions that it approach the 


identity at infinity (except in the Abelian case, where g = e* > OA =0=> A 


0). This is not the case for axial gauges, where 


O=n-A=n-g'(Vg) for n-A=0 => g'(n-dg)=0 => gH=I 


by simply integrating from infinity. 


3. Massive 


In subsection IIB4 we described the introduction of mass for the vector by dimen- 
sional reduction, giving the Stiickelberg formalism for a massive (Abelian) gauge field. 
The gauge-invariant action (subsection IVA5) and BRST transformation laws (sub- 
section VIA4) followed from adding an extra dimension and setting the corresponding 


component of the momentum equal to the mass: 
Lgi = 3 Fay + g(mAa + Oad)” 
QA,=-0.C, Q¢=mC, QC=-iB, QB=0 
where the scalar is the extra component of the vector. 


There are two obvious covariant gauges for such a vector: (1) The “unitary gauge” 
=o 


simply gauges away the scalar. Since the scalar has a nonderivative gauge transfor- 


mation, the ghosts do not propagate: The gauge-fixing term 


~iQ(C¢) = —B¢+imCC 
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simply eliminates the scalar and ghosts as auxiliary fields. The net result is that we 


could have simply chosen 
gauge ¢=0 
and ignored ghosts because of ¢’s nonderivative transformation law. Thus the gauge- 
fixed Lagrangian is just the result of adding a mass term to the massless Lagrangian: 
Lot = gly + em? A? 
But the propagator is 


Nab PaPb 


2[192 (2 2) _ tp |-1 = 9 
ne ae) = PP '| P+ mt mp? +m) 


Notice that the second term is higher in derivatives than the first; this can cause some 


technical problems in perturbation theory. 


(2) The Fermi-Feynman gauge works similarly to the massless case. We then 


modify the gauge-fixing function to 
f=0-A+mé¢ 
so 


~iQ[LC(O- A+m¢ + 4B)] = -1B(0-A+mo+4 4B) —-2iC(O—m’)C 


=> Lop = $F? + 4(mA + 00)? + ¢(0- A+ me)? — FiC(O - m?)C 
= -1A-(O-—m’*)A— 19(0- m?)6 -— FC (O — m’)C 
The propagators are again simpler. The vector has D propagating components instead 


of just the D—1 physical ones; the 2 ghosts cancel ¢ and the extra component in A. 
Exercise VIB3.1 


Generalize the Fermi-Feynman gauge for the Sttickelberg formalism to the 


“renormalizable gauges” with gauge-fixing function 
m 
i= Pid “A+ uo 


a Find the gauge-fixed action. 
b Show that the ghosts and ¢ have mass pu, while the vector propagator has the 


9 (ran + PoE) 1 Papo I 
oe m2 p? +m? m2 pe + 


form 
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This shows explicitly the second unphysical bosonic mode of mass jz to cancel 


the 2 ghosts, as well as the 3 transverse physical modes of mass m. 
c Look at the cases 
0 (Landau gauge) 
f= m_ (Fermi-Feynman gauge) 


oo §6(unitary gauge) 


These two choices of gauge also exist for Yang-Mills theories exhibiting the Higgs 
mechanism, since those models give the Stiickelberg model when linearized about 
the vacuum values of the fields. The advantages are the same: The unitary gauge 
eliminates as many unphysical degrees of freedom as possible (see subsection IVA6 


for an example), while the Fermi-Feynman gauge gives the simplest propagators. 


Exercise VIB3.2 
Work out the Fermi-Feynman gauge for an arbitrary Higgs model, generalizing 


the analysis for the Stiickelberg case. 


4. Gervais-Neveu 


We next consider pure Yang-Mills theory for the gauge group U(N), but use a 


complex gauge-fixing function 
fo a O . A + iA? 


where A, is a vector of hermitian NxN matrices, and A? = A*A,. (The hermitian 
conjugate, 7 — —i, gives similar results.) The gauge-fixed Lagrangian (in the action 
S=g*tr f L) is then 

La=¢F?+4fp =—74A- OA -iA°A0,A, — GA" ALAS 


~ 8 
(where O is the free D’Alembertian) while the ghost action can be written as 
Le = —MCV’C — 1CCfy 


where V acts on C as if it were in the defining representation (i.e, VC = 0C +iAC, 
not [A, C]). This “Gervais-Neveu gauge” already has the simplification that some of 


the terms in the Yang-Mills self-interaction have been canceled. 
Exercise VIB4.1 
Consider the “anti-Gervais-Neveu gauge”, where the same gauge-fixing term 


is added with opposite overall sign. 
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a Show the resulting Lagrangian can be written as 


La ~ tr[(@Atip’)’] 


where the trace is with respect to both (NxN) internal and (4x4) Dirac 
matrices. Thus, spin can be treated in a manner closely analogous to internal 
symmetry. 

b Show the propagator can be written in the form of the product of 2 (massless) 
Dirac-spinor propagators. 

c Starting with the complex first-order formulation of Yang-Mills of subsection 


IIIC4, show that the action can be written in a way that replaces the above 


4x4 matrices with 2x2 matrices, as 
La ~ tr[G? + G(OA* + iAA*)| 


in first-order form, where now G is neither traceless nor symmetric in spinor 


indices (its trace is the Nakanishi-Lautrup field), or in second-order form as 
La ~ tr[(OA* + iAA*)?] 


(Note that this differs from the above Dirac form, as expanded in 2x2 matri- 


ces, because it includes the Chern-Simons term.) 


Next, consider a model where the Yang-Mills fields couple to scalars that are also 
represented by NxN matrices, but that are in the defining (N-component) represen- 
tation of the gauge (“color”) U(N), while also being in the defining representation of 
a second, global (“flavor”) U(N). (See subsection IVA6.) This complex field thus has 
2N? real components compared to the N? gauge vectors, and the 2N? ghosts. We also 
choose a Higgs potential such that the masses of the scalar and vector come out the 
same (but we can also specialize to the massless case). The scalar Lagrangian is then 
(again with g~tr in the action) 


Lg=-36'V64+1R?, R= $6 — bm? 
Finally, we modify the gauge-fixing function to 
f=fotik 


With this choice, the ghost terms are unmodified (R is gauge invariant), but the 
scalar self-interaction is completely canceled (including the mass term). The total 


Lagrangian is then 


L= (Lat im?A’) + (-46'V7o + kolo fo) + i(-4CV7C + i8CC fo) 
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Since the scalar Lagrangian is identical in form to that of the ghosts, and neither 
has self-interactions, functional integration over them will produce canceling func- 
tional determinants, because they have opposite statistics. This is a reflection of the 
fact that both sets of fields now describe unphysical polarizations, since both describe 
massless states in a theory where all physical states are massive (as seen, e.g., in a 
unitary gauge). This has the great advantage that, for this particular model, both 
the scalar fields and the ghosts can be dropped altogether, while the Lagrangian 


L = La + +m? A? 


completely describes the physical massive vector and scalar states. This was possible 
only because of the use of a complex gauge condition: The longitudinal component of 
the vector is now imaginary, which fixes the wrong sign associated with the Minkowski 
metric. A related characteristic of this gauge is that we nowhere needed to change 
the vacuum value of any field, unlike other gauges for actions where there is a Higgs 
effect. 


We now note that this result for the massive case (and its massless limit) actually 
can be obtained more easily than the result for the pure Yang-Mills case: Since the 
final result has no ghosts, it is in a unitary gauge, where the vector not only “eats” 
the usual compensating scalar, but “overeats” by absorbing the physical scalar. The 


appropriate gauge condition is still complex and involves the scalars, but is now linear: 


gauge $= (b) = ayml 


where ¢', treated as independent, is unfixed. (As for the usual unitary gauge Im ¢ = 
0, i.ec., 6 = o', there are no propagating ghosts, since the gauge transformation of ¢ 
has no derivatives.) In this gauge the action becomes quadratic in ¢": 


Ly + —$61i( A + iA?) tom + 4(6! Spm — $m’)? 


In fact, d' appears as an auxiliary field (taking the place of the Nakanishi-Lautrup 
field), so we can eliminate it by its equation of motion: 
é6 
dt 


This procedure is analogous to that used for the lightcone gauge, where one compo- 


= = 34+¥Bifp > L=Lyt+jm?h 


nent of the gauge field is fixed and one is eliminated as an auxiliary field: A closer 


analogy will be found in subsection VIB6. 


Of course, such gauges generalize to other Higgs models, but results will not be 


as simple when the vector and scalar masses differ: 
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Exercise VIB4.2 
Make the coefficient of the R? term in Lg arbitrary, so the masses of the 
vector and scalar are unequal, but choose the same gauge ¢ = (¢). Find the 


propagator, and compare with that of exercise VIB3.1. 


5. Super Gervais-Neveu 


Nonhermitian gauges are also useful in supersymmetric theories: Here we consider 
the supersymmetric analog of the massive model of the previous section. Although 
we work in N=1 superspace, the model turns out to automatically have an N=2 
supersymmetry. Just as the bosonic model ended with only a vector field describing 
only physical polarizations, we now want a real scalar superfield to have only physical 
polarizations. Since such a superfield has 8 bosonic components and 8 fermionic, while 
massless N=1 multiplets have 2+2 physical polarizations, we need 1 vector multiplet 
plus 3 scalar multiplets. Since the bosonic model had a complex scalar representation, 
2 of these scalar multiplets must form the analogous defining ® defining representation 
of local @ global groups, so the last must be a real (adjoint) representation of the 


local group. The model is then given by (where again S = g~*tr f dx L) 


Le = a a Ww? - [es (e~Y doe’ bo + b4e" 64 + o-e "G_) 


= / 20 (b,.0_ — 4m?) do + hic. 


where we have included the only possible scale-invariant potential term, and intro- 
duced a Higgs mechanism by an N=2 Fayet-Iliopoulos term, which we chose to write 


in terms of the chiral scalar. (See subsection [VC7.) 


The BRST transformations (which also imply the gauge transformations) for this 
action are (see subsection VIA4) 


Qe” = iCe” — ie" C, Qe" =—-e-"(Qe" )e-” =iCe-" —ie "C 
Qc =iC?, QC=iC?; QC=-iB, QC=-iB; QB=QB=0 
Qo4=1Co4, Qb0=i[C,¢],  Q¢- =—ib_C 
Qo4 = —i9,C, Qoo = i[C, 60], Qo_ = iCo_ 

Our nonhermitian choice for the BRST gauge-fixing function is 


A=-— / a6 C(Pe-’ + bo) — / 20 C(deY + Go) 
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Note that e” is an element of the algebra as well as a “nonunitary element” of the 
group, only because we chose the group U(N) (as was the case for A? in the bosonic 


version). The gauge-fixing and ghost terms are then 
—iQA= [0 BP + 60) + | 8 Bee” + 6) 
= ; (Ce "Ceca Cc) 4 / 26 CCoo + i a0 CCbo 


where we have used the field equation enforced by the Lagrange multipliers B and 
B (or, equivalently, made field redefinitions of the Lagrange multipliers to generate 


terms proportional to their constraints). 


Exercise VIB5.1 


Make a component analysis of this theory: 
a Expand the gauge-invariant action in components. 
b Do the same for the gauge-fixing terms. 


c Compare the bosonic part of both the gauge-invariant and gauge-fixed actions 


to those of the previous subsection, after elimination of auxiliary fields. 


We now see that the ghost terms are identical in form to those for ¢4, under the 


identification ; 
(4, @., bx, b_) a (C, C, C, C) 


(but beware signs from ordering of ghosts). So again the ghosts cancel the (N=2) 
matter fields, leaving only the N=2 vector multiplet. But we can also eliminate 
the N=1 matter half of this N=2 multiplet using the Nakanishi-Lautrup Lagrange 


multiplers: The final simple result for the gauge-fixed action is thus 
L=- / d’?9 W? — [eo [e~Y (d’eV JeV d?eY + 4m7(eY +e7")| 
A further simplification results from the redefinition (again possible only for U(N)) 
eX 31+V 
This also simplifies the BRST (and gauge) transformation for V: 
QV =i(C —C)+i(CV - VC) 


whose linear form resembles the bosonic case. Using the expression (see exercise 
IVC4.1) 


1 
4 = —id@e "de" 10 ——4, 
W. id-e "dge’ — —1 TV V 
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for the field strength, the Lagrangian becomes 


1 1 1 1 
L=— | do |-1—_(a¢v)#@——4,V + ——(#V) (14+ V)e&—_— 
/ | ay eye ae ey 


1 
+im? (v + sv) 


Although the nonabelian vector multiplet has nonpolynomial self-interactions in 
any gauge, this gauge simplifies the lower-point interactions, which are the ones more 
frequently used for a fixed number of external lines. Expanding this action to cubic 


order, we use the identity 
d°#d,, = d*d’dy = —40+ {d?, d?} 
for the kinetic term, and 
dyd? = d?dg t+ ida4d° 
for the gauge-fixing part of the cubic term, with integration by parts. (For the gauge- 
invariant term, some work can be saved by using the equivalent w form.) The result 
is 
C= / d‘6 {4V(O— mV + [4m?V3 + (d2V)Vidgyad VV] + O(V4)} 
Not only are there fewer terms than with linear gauge conditions, but these terms 
have fewer spinor derivatives, which yields fewer nonvanishing contributions in loops 
(see subsection VIC5). As for the bosonic model of the previous subsection, this 
analysis also applies for the unbroken case m = 0. 
Exercise VIB5.2 
Find the corresponding form of the kinetic and cubic terms without the re- 
definition e” > 1+4+V. 
Exercise VIB5.3 
Gauge fix by using the unitary gauge 


to obtain the same result. 


Exercise VIB5.4 
Look at the super anti-Gervais-Neveu gauge, or super anti-Fermi-Feynman 
gauge, changing the sign of the gauge-fixing term for the vector multiplet (see 
exercise VIB4.1). 


a Show that in the massless case the kinetic operator becomes, instead of H, 


Kw dh = FeO dadgdyds 
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where we now use 4-component spinor indices. 


b Show that the resulting propagator is of the form, in supercoordinate space, 
d‘ 
A(x, 6; 2", 6’) ~ ape — 0')6*(2 — 2’) ~ In[(a — 2! — i5070')?| 


where “x —2! —i3070"” (see subsection IIC2) is the supersymmetry invariant. 
(Hint: Use d* = f[ d*¢ eS"4*. Warning: If derived by Fourier transformation, 


the integral is infrared divergent, and requires dropping an infinite constant.) 


6. Spacecone 


We have just seen that gauge independence allows complex gauge conditions, 
which make the action complex. (In subsection HIC4, we also used complex auxiliary 
fields, with a similar effect.) In this subsection we introduce a complex analog of the 
lightcone, the “spacecone”, which will greatly simplify Feynman diagram calculations 


with massless fields. The spacecone gauge condition is 
A? iA? =0 


or more generally 
n-A=O, v=, n-n* >0 


(but only n*, not n**, appears in the action). While this gauge is spacelike (in the 
sense that only spatial components of the gauge field are fixed), it is also null, by 
virtue of being complex. Thus, although algebraically like the lightcone, it allows 
canonical quantization with the usual time coordinate. In fact, it is just a Wick 


rotation of the lightcone. We then eliminate A? + 7A? as an auxiliary field. 


The spacecone is a new gauge to add to our list of axial gauges n- A = 0 from 
subsection IIIC2, and the related gauges for scalars from subsections IVA5-6, VIB3-4: 


[asta gauges [__voverall [null (4 euriry File) 


(partly) temporal timelike : A° =0 lightcone: At =0, 6/6A7 


spacelike Arnowitt-Fickler : A! =0 spacecone: A'=0, 6/d5A* 
scalar unitary: ¢= ¢'| Gervais-Neveu: = (¢), 6/d¢' 


In fact, at least for the free theories, the gauges for the scalars can be considered 
as dimensional reductions (from 1 or 2 extra dimensions) of those for the vector, as 
used for deriving the Stiickelberg formalism in subsection IIB4, where the spacelike 
components of the vector associated with gauge fixing become scalars: Arnowitt- 


Fickler — unitary, spacecone — Gervais-Neveu. 
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The main advantages of the spacecone over the lightcone are special to D=4, so 
we now review the lightcone in a way specialized to physical spacetime. We first 
repeat the results of subsection IIIC2, relabeling the indices appropriately. Starting 


with the gauge condition (see subsections IA4 and IIA3 for notation) 
A‘ =0 
and eliminating A! by its field equation, the Lagrangian for pure Yang-Mills becomes 


Le Atatat Aq _ (Ft)? ih. 1( pret)? 


F*- =0tA~ -—@- At +i[At, A] 


t =. = all \ | 
FY=@ AT+ATA + i= ([A", 0A ] + [A7, 0A‘) 


We simplify the Lagrangian by using the self-dual and anti-self-dual combinations: 


Dropping also the ¢ superscripts on O for simplicity, 


Fr _ i(Fet Ft-) = ot AF + i=[At, OA] 
L=A*t0OA- + FtF 

— + 
= AtIOMA- -3 (Fa4*) [At, 0A] -i (Sa) [A-, OAT] 


0 


+ [A*, 0A ]—[A, 0A] 


1 
fa 
Exercise VIB6.1 
Label all the fields and derivatives in the above forms of the Lagrangian, F 
in particular, in spinor notation. 
Exercise VIB6.2 
Show that the field redefinitions A+ — (0)*'¢*, when applied to just the first 


two terms of the above Lagrangian, produce a local action describing the self- 


dual field equations of the lightcone formalism of subsection IIIC5 (taking into 
account the difference between the lightcone and spacecone). Compare the 
results of exercise IIIC5.2. Thus, by treating the latter two terms separately 
from the former two, Yang-Mills can be treated as a perturbation about self- 
dual Yang-Mills. 


Another simplification for massless D=4, and closely related to the use of helicity, 
is twistors. For our Feynman diagram calculations for spins < 1, almost all spinor 
algebra involves objects carrying at most two spinor indices (spinors, vectors, self-dual 


tensors), so we use the twistor matrix notation of subsection IIB6. In particular, in a 
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general class of gauges the external line factors for Yang-Mills fields in this notation 
(see subsection VIB2) read 


= le) {| a = 
per ea => f*=ilpl[pl, f=0 
for + helicity or 
Ip) lel 
A — = — — * — 
a = f = ip) (pl, 0 


for —, where (€1)* are the polarization 4-vectors for helicity +1 in terms of a twistor 


© €* which can vary from line to line, and whose choice defines the gauge, as a special 


€ 
case of the Landau gauge. (Positive helicity is the same as self-duality, negative is 
anti-self-dual. The Landau gauge condition is generally applied in arbitrary Lorenz 


gauges to external lines, to eliminate the redundant longitudinal degrees of freedom.) 


One special case is the lightcone gauge 


in terms of an arbitrary constant lightlike vector n. A more convenient gauge is the 


spacecone gauge, which can be written in terms of two twistors: 


n= lex)le| 


These two twistors are sufficient to define a complete reference frame: We can convert 


all spinor indices into this basis, as 


yy — wr e4° 


etc. This corresponds to using two lightlike vectors to define the spacecone gauge, 


n= = |e,)[ex|. For simplicity, we write |e,) = |); then a vector in this basis can be 


n 


written as 
p=p'|+)[+|+p |-)[-|+p'l-)414+ 21+)(-| 


if we use the normalization 


E.g., for massless momentum p = |p)[p], 


pt =(p—)[-p], po =(+p)[pt], p= (tp)[-p], 2° = (p—) [pt] 


We will also drop the superscript t in contexts where there is no ambiguity. This 


basis is related to our previous spinor basis up to phase factors, |+) ~ |+), |] ~ |.], 
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and we assume them to be commuting (rather than anticommuting); these changes 


are more convenient for dealing with twistors (commuting spinors). 


The advantage of the spacecone is that we can Lorentz covariantize the Feyn- 
man rules by identifying these two lightlike vectors with physical on-shell massless 
momenta. We need two such “reference” vectors because we are not allowed to have 
n = pon any line. Since only |+) appears in the external line factors for helicity 
+1, and only |—] in those for —1, the simplest choice is to pick the momentum of 
one external line with helicity +1 to define |—] for all lines with helicity —1, and pick 
the momentum of one line with helicity —1 to define |+) for lines with helicity +1. 
(In the presence of massless external spinors, we can also choose a helicity +1/2 line 
to define |—], etc.) Our above normalization means that we have chosen the phase 
(+—)/[-+] = 1 as allowed by the usual ambiguity of twistor phases, while our choice 
of the magnitude (+—)[—+] = —(+][+]-|—)|-] = 1 is a choice of (mass) units. In 
explicit calculations, we restore generality (in particular, to allow momentum integra- 
tion) by inserting appropriate powers of (+—) and [—+] at the end of the calculations, 
as determined by simple dimensional and helicity analysis. (This avoids a clutter of 
normalization factors \/(+—)|—+] at intermediate stages.) For example, looking at 
the form of the usual spinor helicity external line factors, and counting momenta in 
the usual Feynman rules, we see that any tree amplitude (or individual graph) in pure 


Yang-Mills must go as 
( el oo 


where F+4 is the number of external lines with helicity +. 


We now return to external line factors. The naive factors for the above Lagrangian 
are 1, since the kinetic term resembles that of a scalar. However, this would lead to 
unusual normalization factors in probabilities, which are not obvious in this complex 
gauge. Therefore, we determine external line factors from the earlier spinor helicity 


expressions for external 4-vectors. In Lorenz gauges (€4)* would be the polarization 


for helicity +1 for the complete 4-vector, but in the spacecone formalism only A* 


appear. Furthermore, in the spacecone we find 


: +) Lol 
a) =-H)i|.-—“ =0 
(4 = “HLH 
IP) =| 
(Ey Se =) =0 
[-p| 
since by antisymmetry (++) = [——] = 0, so that At carries only helicity +1 and 


A~ only —1. (This statement has literal meaning only on shell, but we can make this 
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convenient identification more general by using it as a definition of helicity off shell.) 


The appropriate external line factors for these fields are thus 


+ jy). lel _ Eel 
(PEL _ (+) 


Note that these factors are inverses of each other, consistent with leaving invariant 


(the inner product defined by) the kinetic term. 


An exception is the external line factors for the reference momenta themselves, 
where |p) = |-F) for helicity + gives vanishing results. However, examination of the 
Lagrangian shows this zero can be canceled by a 1/0 in a vertex, since p = p = 0 for 
the reference momenta by definition. (Such cancellations occur automatically from 
field redefinitions in the lightcone formulation of the self-dual theory.) The actual 


expressions we want to evaluate, before choosing the reference lines, are then 


ps ~ (+p)[=p] Fel Fe 
Evaluating the former at |p) = |—) and the latter at |p) = |+), we get 1 in both cases. 
In summary, for reference lines: (1) use only the 3-point vertex of the corresponding 
self-duality (+ + + for helicity +), and use only the term associating the singular 
factor with the reference line (the other term and the other vertices give vanishing 
contributions); (2) including the momentum factors on that line from the vertex, the 


external line factor is 1. 


7. Superspacecone 


To generalize these results to high-energy (massless) QCD, we consider supersym- 
metric QCD, i.e., Yang-Mills coupled to massless fermions in the adjoint representa- 
tion. For tree graphs, this is equivalent to ordinary massless QCD except for group 
theory, which can be evaluated separately. We first apply the spacecone approach to 
the component action for supersymmetric QCD: The modification of this action for 
ordinary massless QCD is trivial (replacing the adjoint quark current with defining). 
From this we derive the “superspacecone” formalism, rewriting the action more sim- 
ply in terms of spacecone superfields. (This form can also be derived from the usual 


superspace, but we will not consider that here.) 
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We now combine the spacecone approach to pure Yang-Mills of subsection VIB6 
with the spacecone version of the lightcone treatment of the massless spinor in sub- 
section IIIC2. We modify the lightcone to the spacecone for the quarks by instead 
eliminating wy? and pe as auxiliary. For later convenience, we also write the remaining 
fermionic fields as 

Pov, yao 
Then we directly find the terms in the Lagrangian 


L= AtO0A~ + FtF- + it (8 - V-AV*) 


F* = OF AT — L([A*, i AF] + (bt, d-}) 


where the quark term in F* comes from the quark coupling to A’ when using its 


equation of motion to solve for F. Collecting terms, we have 
vf = Ly ++ Ls -++ I4 
1 
i ni | 
Ly = AT$OA a ener 
—i 


ba = (Sat) (a io] + (ot 0) + (Fu) 407 


Oo? 


where terms with + have only a single sum over it. Although this Lagrangian is 


La = —(A*, 0A) Hw ew P(A, iY HU oY [At ol 


much messier than the original covariant one, one again saves work by expanding 


terms once in the action rather than repeatedly for each Feynman diagram. 


External-line factors for the spinors follow from the covariant ones of subsection 
VIB2 as they did for the spacecone vectors of subsection VIB6. We thus have 


gr =(-p], pw =(+p) 
Compared with those for A*, we see tw has an extra factor of p = (+p)[—p] as 


compared with AtA7, as expected from the extra factor of 1/(—i0) in the kinetic 


operator. Similarly, if we choose to use external quark lines as reference lines, we use 
- + 
Pog Dis ps 
Dp Dp 
which also reduce to 1 for the appropriate reference momenta. 


Noting that the bosonic and fermionic terms are the same except for factors of 
—iO, we can combine them into chiral superfields that depend on only two anticom- 
muting coordinates, + and 67 (really 6® and 6°): 


= a tr fae do*dd L, [eorae- =f Or —U ha, {d,,d_} =-id 
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dio* = 0; @ |=A ’ di@ |= 


(no sum on +). These spinor derivatives (and their corresponding supersymmetry 
generators) describe only spatial supersymmetry, since they contain no time deriva- 


tives. Then, using the identity 


ds.d_[~, 6*]| = [A™, -i0A™] + {y*, b} 


we easily combine the terms in the Lagrangian L into the superspacecone Lagrangian 


L: 


40 OF = 1 
_ 2 = cae + is 
The last term can also be written as 
d.d_ 
—|¢r, d | = [o*, ¢ | 


Exercise VIB7.1 
Introduce another pair of chiral superfields as auxiliary. Show the above 
£ then can be rewritten in local form, with no spinor derivatives, where the 
kinetic term resembles the covariant one for a massless spinor, while the inter- 


action term contains no derivatives and is only cubic. (Hint: did+/(—i0) are 


projection operators for chiral superfields.) Thus this Lagrangian resembles 
the Chern-Simons one that appears on the 3D boundary for the topological 
term in Yang-Mills (see subsection IIIC6). Expand the action in components, 


and separate out the pure Yang-Mills part. 
Exercise VIB7.2 


Repeat exercise VIB6.2 to obtain the superspacecone action for selfdual super 
Yang-Mills, quadratic in @* and linear in ¢~. Use the field redefinition 


g dye, dy =0 


and integrate the action over just 0~ (by acting with d_) to obtain a “chiral” 
action, with no spinor derivatives, and superfields that are functions of just 
6* integrated over just 0*. After further redefinitions as in VIB6.2, obtain an 
action identical to the nonsupersymmetric one obtained there, except for the 


J d0*. Expand in components, and relate to the nonsupersymmetric case. 
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8. Background-field 


A more general type of gauge choice is the background field gauge. As we saw 
in subsection VC1, the generating functional can be written in a form where the 
quantum field is expanded about a background field in the interaction part of the 


classical action. The basic steps of the background field gauge method are: 
(1) choose gauge fixing that is gauge invariant in the background gauge field, 


(2) show that the quantum/background splitting of the entire gauge-invariant action 


is also gauge invariant in the background gauge field, and 


(3) show that the effect of splitting the kinetic term in the gauge-invariant action can 


be neglected. (Only the interaction terms should have been split.) 
The result is then that the effective action I’, which depends only on the background 


fields, is gauge invariant in them. This gauge invariance is a strong condition which 
not only simplifies the effective action but allows a “background gauge” to be chosen 
for it that is independent of the “quantum gauge” applied to the path integral: The 
background fields and quantum fields can be in different gauges. For example, for a 
relativistic treatment of the quantum corrections to bound states whose constitutents 
are nonrelativistic (such as the hydrogen atom), it is convenient to use a Fermi- 
Feynman gauge (convenient for relativistic matter coupling to electromagnetism or 
chromodynamics) for the quantum fields and a Coulomb gauge (convenient for static 


or nonrelativistic matter) for the background fields. 
A simple way to formulate the background expansion is in terms of the covariant 
derivative: 


A> A=A+A 3S VOD+HiIA, D=A+iA 


where A is the quantum field (the variable of path integration) and D is the “back- 
ground covariant derivative” in terms of the background field A. We then find for 
the field strength 


Fy > —i[D, + 1Aq, Dy + 2As] = Fob + Dia Asy + i[Aa, Ao] 


and similarly for the action. Matter fields are split as usual, 


o> b=" +9 


We now have two gauge invariances, corresponding to the two gauge fields. Both 


transformations are defined to have the same, usual form on V = D+ 7A (and on 
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@ = y+ ¢), and thus both leave the action inert, but (1) the “background gauge 


invariance” is defined to transform the background fields covariantly 
background: = Di =eDe (y= ey), Vi=e®Ve® (6! =e) 
= A’ = e Ae” (¢' = e¢) 


and thus the quantum field transforms as a matter (non-gauge) field, while (2) the 


“quantum gauge invariance” is defined to leave the background fields inert 
quantum : Dap). (og =e, VY =e°Ve” (¢' =e) 
=> A=e((iD+ Ale) [pv =e*(~+ 9) - 9] 
The latter then determines the new BRST transformations 


QA, =—[D,+iA,,C], QC =iC?, QC=-iB, QB=0 [Q¢=iC(y+¢) 


The key to the background field gauge is to break the quantum invariance, so 
a propagator can be defined, but preserve the background invariance, so the path 
integral is gauge invariant. Since @, the BRST operator for the quantum gauge 
invariance, is now background gauge invariant, we need only choose a gauge-fixing 
function A that is also background gauge invariant. Many gauges are possible: The 
basic rule is to modify any normal gauge condition simply by replacing any partial 
derivatives O with background covariant derivatives D. For example, for a Lorentz 
covariant gauge 

0-A-—D-A 


We then gauge fix in the usual way, and now the gauge-fixing terms and the ghost 
terms are background gauge invariant, as long as we define the ghosts to transform 


covariantly: 
background : C! =e?Ce, CaeCe™, B’=e*Be™ 


For example, for Lorenz gauges the ghost term is modified, by the modification of the 


gauge condition and the quantum BRST transformation, as 
CO-VOSCD'C4CD AC) 


Furthermore, even axial gauges are modified: For example, even though the gauge 
condition Ag = 0 allows elimination of a component of the quantum field, it doesn’t 


affect the background field, which now appears in the ghost Lagrangian 


CHC ed CDoC 
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Since the S-matrix is gauge-independent (when BRST is used to perform gauge 
fixing, as we have), we can use the background field gauge version of the generating 


functional (now using ¢ to represent all quantum fields and vy all background), 
Zi¢|= f Dee, S$ = Sold] + Silo + ¥] — iQAld, y] = $ — (Sold + ¥] — Sol¢l) 


S=S[dt+y]-iQA, —S[d] = Sold] + Sil] 


where S[@] is the original gauge-invariant action, QA is the gauge-fixing as de- 
scribed above, and § is the sum of this gauge fixing and the background-expanded 
gauge-invariant action. We have thus separated the total action S appearing in the 
background-gauge-fixed generating functional into the background-gauge-invariant 


part S minus the noninvariant terms So [6 + y] — Sold]. 
As usual, the classical part of the effective action I'[y] is given by adding the 


kinetic term So[y] of the (gauge-invariant) classical action to the 1PI tree graphs, 
which are just the vertices for the background fields. (The Q term doesn’t contribute 


because it has no pure background piece.) Thus, 
Tetass(%] = S|e-0 + Solv] = S|g—0 = Sly] 


We now note that, as far as calculating just the effective action is concerned, 
we can drop all terms in the gauge-fixed action independent of or linear in ¢: Any 
independent term contributes only classically; any linear term will generate one- 
particle reducible graphs (“tadpoles”). This means we can drop the noninvariant 
terms Sol¢ + y] — Sold] from S. Thus, the Feynman rules for calculating I’ are: 
(1) Use the classical gauge-invariant action S|y] for the classical contribution to I; 
and (2) for the quantum contribution, use all the 1PI loop graphs coming from S. 


The result is background gauge invariant, since S is manifestly so. 


Another important feature of the quantum-gauge-fixed background field action is 
that it is background-gauge-invariant order-by-order in the quantum fields. In fact, 
every term in the corresponding ordinary gauge action has been replaced by one (or 


more, if there are F terms) background-gauge-invariant term. 


Exercise VIB8.1 
Consider the Fermi-Feynman background-field gauge for the quantum field 
of pure Yang-Mills theory. Write all terms (both gauge-invariant and gauge- 


fixed) quadratic in the quantum field. Show that these combine as 


—1A-OA — it A"|Fop, A” 
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where 0) = (D)? (and “DA” means “[D, A]”, etc.). 


Since all external lines are associated with background fields, if we draw graphs 
in such a way as to exhibit only the quantum fields, they will all look like vacuum 
graphs: graphs with no external lines. However, any particular such vacuum graph 
will represent many of the original graphs, since the background lines can be attached 
in many ways. Furthermore, in background field gauges any such vacuum graph, con- 
sidered as a contribution to the effective action, will be gauge invariant with respect 
to the background gauge transformations, since it results from the non-background 
gauge true vacuum graph by the replacement of the ordinary derivative with the 
background covariant derivative 0 — D (plus perhaps some noniminimal F terms), 
including in the propagator. In particular, the complete one-loop contribution to I’ 
is given by the vacuum graph with no quantum interactions: It can be obtained from 


just the part of the S that is quadratic in the quantum fields. 


D = Re 


Ko 


Exercise VIB8.2 
Consider an arbitrary gauge-invariant Yang-Mills action S [A] with A= A+A 
in terms of the background field A and quantum field A. Taylor expand the 


action in A as 


The infinitesimal quantum gauge transformation mixes different-order terms 
in the expansion. Show that the term quadratic in A is invariant under an 
Abelian quantum gauge transformation only if the background satisfies the 
field equations, dS{A]/6A = 0. Similar remarks apply to BRST transforma- 
tions and the gauge-fixed action. (Since quadratic actions, even in background 
fields, yield only a propagator, they can be described by first-quantization: 
Thus gauge invariance implying background field equations occurs whenever 
a gauge field appears as both a quantum mechanical state and a background 
field, for example in string theory. See subsection XIIB7 for a simpler exam- 
ple.) 

The S-matrix is then given in the usual way from I[y], after adding another 


gauge-fixing term for the background gauge invariance. Since the total S-matrix is 
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given by just the trees following from treating I" as a classical action, we need only 
a gauge-fixing term for the physical fields, and we can ignore background ghosts. 
(Of course, quantum ghosts were already used to calculate I.) This background 
gauge fixing is independent of the quantum gauge fixing. In particular, we can choose 
different quantum and background gauges: For example, when treating spontaneously 
broken gauge theories, it’s often more convenient to choose a Fermi-Feynman quantum 
gauge and a unitary background gauge; i.e., we expand the background fields about 
the physical vacuum to make the physical states obvious, but leave the quantum 
fields unexpanded to avoid complicating the Feynman rules. This also avoids the 
complication of having to expand about the vacuum twice, since vacuum values get 


quantum corrections to those appearing in the classical action. 


The gauge invariance of the effective action in the background-field formalism is 
a big advantage over other quantum gauges, where the effective action is only BRST 
invariant, since gauge invariance is a much stronger constraint than BRST invariance: 
Gauge symmetry is local, while BRST is only global. Thus, the background-field 
gauge produces a much simpler effective action. In other words, the background- 
field gauge produces an effective action without ghosts: Although we can drop ghost 
terms from the effective action in general, because there are no physical external 
ghost states (since we calculate only the “tree” graphs of the effective action), the 
result is not normally BRST invariant; but in the background-field gauge it is still 
BRST invariant, since it is gauge invariant. This means that the background-field 
gauge yields not only simpler results, but fewer calculations: Many terms can be 


determined by “gauge covariantization” . 


Exercise VIB8.3 
Consider an effective action for Yang-Mills plus matter in a background-field 
gauge. Its gauge invariance can be used to derive “Ward-Takahashi identi- 
ties”. (These were originally expressed as properties of the S-matrix, but are 


much simpler to understand in terms of the effective action.) 


a Show that the part of the effective action quadratic in the Yang-Mills fields, 
and independent of the matter fields, is invariant under the Abelian gauge 
transformations. (Hint: Taylor expand.) Thus, in such gauges the quantum 


correction to the gluon propagator is transverse. 


b By the same method, find a relation between any quantum 3-point vertex 
coupling matter to Yang-Mills and the corresponding matter propagator cor- 


rection. (Note a simpler case: Since the renormalization counterterms are 
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local, gauge invariance just says that the coefficients of the two corresponding 


counterterms are the same, i.e., they occur in the combination ~ WYW.) 


However, this does not mean we can completely ignore BRST and ghosts by using 
background-field gauges: Although the effective action is gauge invariant and ghost 
free, ghosts and BRST still appear in the (quantum-gauge-fixed) classical action. In 
practice this means, as far as calculating the Feynman graphs that contribute to the 
effective action, that in the background-field gauge calculations are about one loop 
simpler than in other gauges. For example, for one-loop graphs we effectively cal- 
culate free one-loop vacuum bubbles (including ghosts) covariantly coupled to back- 
ground fields: There are fewer of the complications of nonabelian theories, since the 
quantum fields appear only as non-gauge fields with covariant couplings and no self- 
interactions. However, already at two loops we have self-interactions of the quantum 
fields, which include the same kinds of terms that would have appeared had we not 


used a background-field formalism. 


Another complication is that BRST invariance is not as restrictive as gauge invari- 
ance: It can be shown that in general gauges at the quantum level BRST invariance 
is preserved only up to “wave-function renormalizations” (rescalings) of the quan- 
tum fields. However, in the background-field gauge wave-function renormalizations 
of the quantum fields can be ignored, since the quantum field is a dummy vari- 
able: There are no external quantum fields, so all such factors cancel. (Actually, we 
can also ignore wave function renormalization counterterms in non-background-field 
gauges, since when calculating S-matrix elements such divergences will be canceled 
by corresponding divergences in the external-line factors. In general, external-line 
normalization factors may be nontrivial even when wave-function renormalization is 


performed, depending on the renormalization scheme.) 


An exception is Abelian gauge theories, such as QED: Because the gauge-invariant 
action for just the gauge fields is free, background field gauges are identical to ordinary 


gauges. Also, the ghosts decouple (for linear gauge conditions). 


The main point of the background-field gauge is that two gauge choices can be 
made. This method can be further generalized so that there are three independent 
gauge choices: (1) First we choose the quantum gauge as before, to obtain an effective 
action that is gauge invariant with respect to background gauge transformations. In 
terms of S-matrix diagrams, this is a choice of gauge for propagators inside loops. 
(2) Then we choose the background gauge as before, to obtain S-matrix elements. 
This is a choice of gauge for propagators external to 1PI subgraphs. (3) Finally, there 


is still a gauge invariance of the external fields: These fields describe asymptotic 
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states, and hence have a linearized gauge invariance. This means that the generating 
functional 7(A), or W(A), can always be written in a form invariant under Abelian 
gauge transformations. (In fact, this will be the case in general, without a background- 
field gauge, since the S-matrix is gauge independent.) As a consequence, Z and W 
can always be rewritten in terms of Abelian field strengths, making this invariance 
manifest. (However, the Feynman rules generally will not give them directly in this 
form.) Writing them in this form has the same advantages as manifesting background 
gauge invariance in the effective action: There are fewer possible terms one can write, 
Lorentz covariance is manifest, comparison is easier, and more gauge choices are 
available. (In practice, we usually choose some unitary gauge for the external fields, 
to isolate the physical polarizations.) Furthermore, since asymptotic states are on 
shell, these external Abelian field strengths satisfy the free, Abelian, gauge-covariant 
(Maxwell) field equations, giving further restrictions on the number of independent 


ways they can appear (with respect to derivatives acting on them). 


9. Nielsen-Kallosh 


So far we have considered only gauges where the gauge-fixing term is the square 
of the gauge-fixing function. More generally, we’ll need gauge-fixing terms of the form 


fOf for some operator O. Straightforwardly, we can write 


iQ}CUI(A) — JoO™'B] = 3B(f — fo0~'B) + HOLE. [v, C| 


However, B is no longer auxiliary, so we can’t eliminate it by its field equation. But 


we can diagonalize the Lagrangian by the corresponding redefinition, 
BoB+ 40 a 


(The Jacobian of such redefinitions is unity, the determinant of a triangular matrix 


of the form é aa .) The gauge-fixing terms are then 


—iaBO'B+2fOf 


The inverse operator is inconvenient for Feynman rules. We know that integrating 
out B gives a functional determinant, so O~! can be replaced by an O if we change 
the statistics of the Nakanishi-Lautrup field. However, this is a bit formal, since tech- 
nically O must be symmetric between the two B’s, while it should be antisymmetric 


between two fermions. 


Exercise VIB9.1 
Instead of f(A) — 4a07!B, use O[ f(A) — 4aB], and again diagonalize. The 
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extra factor in the ghost kinetic term can then be put in a separate term 
by (the inverse of) the method of exercise VIA4.2. This method avoids any 
symmetry problems with O. 


A useful example is gauge fixing for super Yang-Mills in superspace. Gauge fixing 
for massless Yang-Mills is actually more difficult than for the massive (Higgs) case, 
considered in subsection VIB5. We’ll look at the Abelian theory, to determine what 
kind of gauge fixing we need to define the propagator. (With slight generalization, 
this is also sufficient for the background-field gauge: See the following subsection.) 


In that case the BRST transformations are 


QV=i(C-C), QC=-iB, QC=-iB, QC=QB=QC=QB=0 
where C, C, and B are chiral. Using the result (the Abelian case of exercise [VC4.1) 
W, = —id’d,.V 


the gauge-invariant kinetic term is (rearranging derivatives and using integration by 


parts; see subsection VIB5) 
i= - [9 sW°W,, = - f ate sVd*d?dV = [eo V(40 - d?d?)\V 
To gauge-fix to the Fermi-Feynman gauge we choose 
I =-iQ f ate (C+ Ov + CGD) "B 
= f te (Ec -C)- (8+ BW - BGO) "B 


(dropping d*@ integrals of totally chiral or totally antichiral terms, which vanish). If 


we were to simply redefine B by 
B>+B-@d@V, BoB-ddv 
the gauge-fixing terms would diagonalize as (using d?d?d? = $M?) 
—~(B+ BV —- B(g0) 'B+Vd?d’V — B(Z0)'B 
giving the desired result for V: At this stage the total result is 
b=Io+li— fae QVOV+6c-6C- BGO) "5) 
Because B is complex, the replacement of B with a fermionic superfield can 


be performed classically, just like the rest of the gauge-fixing procedure. We thus 


introduce ghosts for a trivial gauge invariance as described in subsection VIA4: 


QD=E, QE=-iD, QE=QD=0 
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We have treated the ghosts and their hermitian conjugates independently; alterna- 
tively, we can consider D and FE as not being the conjugates of D and E. The gauge 
fixing is simply 

Ly = -iQ(-ED) = DD -iEE 


We next make the redefinition 


D=+=D+(40)'@B, D—-D+(s0)'dB 
which has the effect 
[es [DD — B(40)"'B| > [eo DD+ [es DB+ [es DB 
which vanishes, after using the now-algebraic field equations from varying B and 


B. Alternatively, we can make this field redefinition instead of the previous field 


redefinition: We then have the terms 
[es [DD —-(B+B)V]+ [es DB+ [es DB= [eo VaaPV 
after using the still-algebraic B equations. 
The net result 
L=ILo+ 11+ Ly — f ats (AVOV + Cc — CC -iEE) 


is that the original nonlocal B term has been replaced classically with the local 
EE term, which yields the same determinant upon quantization, but gives simple 
Feynman rules more directly. (The determinant is nontrivial in the background-field 


gauge. A similar procedure can be applied to gauge fixing for spin 3/2.) 


Exercise VIB9.2 
Apply the method of exercise VIB9.1 to super Yang-Mills, where O is now 
d or d? (as implied by the form of L, above). Thus, the expression in L; on 
which Q acts will have terms V@C and CB , and their h.c. Show the result, 


instead of 3 fermionic ghost pairs, is 4 fermionic and 1 bosonic ghost pairs. 


Exercise VIB9.3 
Perform the analogous quantization for the nonabelian case of pure super 
Yang-Mills (no matter), using the super Gervais-Neveu gauge. Compare with 
the limit m — 0 of the model considered in subsection VIB5, and show the 
V part of the action agrees. 

Exercise VIB9.4 
Use this method to produce a gauge-fixing term a(n-A)D(n- A) for a gauge 
vector A in terms of a parameter a and constant vector n. Find all propaga- 


tors. Look for simplifying special cases of a and n. 
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10. Super background-field 


Although in principle the background-field formalism is the same for supersym- 
metric theories as nonsupersymmetric, there are some technical differences because of 
the nonlinearity in the prepotentials. (Similar remarks apply to nonlinear o models.) 
The basic idea is that we want to expand the full covariant derivative in quantum 
fields about background-covariant derivatives: As for the nonsupersymmetric case, 
V — D+iA, but now A is further expressed in terms of D and the prepotential 
because of the constraints. The generalization in this case (and for nonlinear 0 mod- 
els) is easy because the solution to the constraints makes the prepotentials appear 
as (complex) group elements: Because of the closure of group multiplication, we can 
write 


& — SBsaq 


in terms of quantum (gg) and background (gg) group elements (fields). More explic- 


itly, for our case we write (see subsection IVC4) 


2 


e? — e@B e%a 


and thus for the covariant derivatives 
-Q Q 
Vawe “Dye? 


absorbing the background prepotential completely into the background covariant 
derivative 
D, =e "8? d,e?8 


In other words, as the name suggests, the full covariant derivative V has been ex- 
panded about an arbitrary background, described by (2g. (This is even clearer in the 
supergravity case, where we simply replace the flat-space dy with the curved-space 
D,., since dy is more than a partial derivative, and already contains the flat-space 
part of the metric tensor.) For purposes of quantization, it is most convenient to go 
to a chiral representation for the quantum field. For the background field we need not 
be so specific, since it is hidden in the background covariant derivatives. The result 
is then 
Vee De’. Va— Da, Vad — i{Dz, ee” Dye” } 


where V is the quantum field. 
Exercise VIB10.1 


Solve the rest of the commutator algebra to find expressions for all the field 


strengths in terms of V and Dy. 
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The rest of the quantization procedure then follows as for the nonsupersymmetric 
case, except for the Nielsen-Kallosh ghost described in the previous subsection. In 
particular, for the terms in the gauge-fixed classical action quadratic in the quantum 
field V, 


W,, + —i4[D*, (Dy, e~-” Dae” }] = Wa — iD? DV + ihD?\V, DLV] + O(V?) 
=> Syv= / dx d*0 V(-4D°D?D, + i3W°Da + D?D’)V 
Pushing the D in the first term to the right, we find 
—1D°D’D, = ii (D“DzDo + DgD™ Da) — D°D® 


Using integration by parts on all the derivatives in the second term so they act to the 


left, then switching the V’s so they again act to the right, 
DODD DOD, DYDD. D, 
=D D.D, kD), De (DD: 
= -i0 + 2W"D, 
where = D*D,. The final result is similar to the bosonic case (exercise VIB8.1): 
Sw = i: de 6 1V(O+ 2W°D, + 21" D,)V 
(This result is invariant under integration by parts because of the Bianchi identity 
DW? +D,W = 0.) 


Ghosts and matter are quantized straightforwardly: For matter we have 


Vad Vad 0 > 


get, dae(G+9), Dep=DaG=0 
The action thus looks the same as usual ((6+¢)e” (y+), etc.), except that all chiral 
superfields are now background-chiral. For the standard ghosts we have for the ghost 
action So = f dx d‘@ Lo (remembering there are no background ghosts, and using 


the full nonlinear transformation law from exercise IVC4.3) 
Le = (6 + C)Lypleoth(Lva)(C — €) + (C+ OQ) = (6 + CVC - 6) + O(V) 
(6C=CC sO) 


the same as in non-background gauges, except again the ghosts are background-chiral. 


Now the Nielsen-Kallosh ghost of the previous subsection is nontrivial: We again have 


Lux = —iEE 
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but these ghosts also are background-chiral. This means they contribute to the effec- 


tive action only at one loop, through “vacuum bubbles”. 
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We have seen how covariant expansions of the S-matrix can be based on various 
definitions of h. Covariant expansions can also be based on spacetime quantum 
numbers: For example, we can perturb in mass; this is equivalent to adding low- 
energy corrections to the high-energy approximation. Also, the first-quantized version 
of the h expansion, which expands in powers of momenta, is effectively an expansion 


in inverse powers of mass (low-energy approximation). 


The only other spacetime property of a particle is spin, or helicity for massless 
particles in D=4. It is possible to define expansions in terms of it by describing the 
leading order by a complex action. This violates semiclassical unitarity at that order; 
however, the loop expansion violates unitarity at tree order also, so the expansion 
is still useful as long as unitarity returns once the expansion has been summed. 
Furthermore, we have already seen that gauges where unitarity is not manifest have 
some advantages over unitary gauges. In particular, the Gervais-Neveu gauge uses a 


complex gauge condition. 


1. Yang-Mills 


We first consider calculations for massless theories; these are simpler than massive 
ones in D=4 because the little group of the Lorentz group is SO(D—2) instead of 
SO(D-—1), and is thus Abelian: We can label the spin of a state by an integer or half- 
integer, the helicity, by use of the spacecone formalism. To simplify notation, we drop 
the transverse index (p' — p), and distinguish 4-momentum P from its transverse 
component p by using upper- and lower-case. We also use color ordering; i.e., we 


examine only planar diagrams for each permutation of external lines. 


We begin by summarizing the spacecone rules for pure Yang-Mills found in sub- 
section VIB6: The Lagrangian appearing in the action S = gtr [dx L, writing 


derivatives as momentum operators for later convenience, is 


L= At(-}P*)A~ + (2A*)[A*, pA] + (2A) [A pA] + [At pA] a[4-, pA*] 
Twistor notation (see subsection IIB6) is used: 
(pq) = —(ap), (pa){rs) + (ar) (ps) + (rp)(qs) = 0, (paq)* = [ap] 


p’ =(p-)[-p], po = (+p)[pt], p= (+p)l-pl, p= (p—)|pt+] 
(+-) =[-+] =1 
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The propagator and vertices are read from L in the usual way, but in addition we 


have further simplification from the choice of external line factors 


[=p] (+p) oe Pp 


E=—, €_ = —; —€g = —ECn = 1 
* +p) [-p] p- : 


where @ and © are the reference lines, with + and — helicity, respectively (not to 
be confused with the earlier notation for spinor indices a = (@,©)). However, the 


reference momenta for helicities + are taken from lines with helicities +: 
fe=\-i-&  fe=))4| 
=> PL=0', PZ = 6% 


The reference external line factors occur only in the above combinations, because only 


1 term of 1 of the 3-point vertices contributes to each. 


The simplest examples are classes of diagrams that vanish by virtue of their 
“maximal helicity violation”: By simple counting of +’s and —’s, we see that the tree 
graphs with the fewest external —’s, those with only self-dual vertices (++—), have a 
single external —. Thus the all + amplitude vanishes automatically. Furthermore, the 
diagrams with a single external — must have that line chosen as one of the reference 
lines. However, by the above rules that line can carry only the antz-self-dual vertex 


(——+), so those amplitudes also vanish. 


The simplest nonvanishing amplitude is ++——. We consider the case where the 
helicities are cyclically ordered as ++——; we label them 1234, and choose 1 and 
4 as the reference lines; this amplitude can be denoted as ®+—©C. (Py = |+)(4+], 
P, = |—)({—|: The positive-helicity reference line gives the reference momentum for 
negative helicity, and vice versa.) We label all external momenta as flowing inward. 
There are only three diagrams; however, the + reference line uses only the ++ — 
vertex, while the — reference line uses only the ——+ vertex, so the 4-point-vertex 
diagram vanishes, as does the diagram with both reference lines at the same vertex. 


Thus, we are left with only 1 graph. 


2+ 3- 


16 4e 


Furthermore, we know that the 3-point vertices contribute only 1 term to the 


reference line, so this graph has only 1 term. This means we can immediately write 
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down the answer (dropping the factors of —g at each vertex): 


fo eee a 7 gp 
€94€3- Pp2p3 1(P; nm P,)? +2) [— (+2)[ 2](+3)[ 3] (34) [34] (+—-)[-4+] 
_ [12]?(34) 
~ [34(41)(14) 


2—-E+| ]2-E-) | and used 


where we have restored helicity and dimensions (trees go as ( ) 
Pi = pa = 0. We have omitted the usual group theory factor (see subsection VC9). 
(Note that the propagator is —1/4P?, because of the signature for the + spacecone 
components. This extra sign cancels that coming from the fact that one vertex has 
cyclic ordering and one anticyclic with respect to group theory, i.e., the commutators 


in the action.) Using the identities, following from overall momentum conservation, 
(Pr + Pa)? = (Po + Ps)’ = [41] (14) = [23] (32) 


do lp)Pl=0 =  (84)[14] = —(82) [12] 
this can be put in the standard form 
[12]* 
[12][23][34][41] 
Exercise VIC1.1 


Using similar manipulations, cast it into the form 


(34)" 
(12) (23) (34) (41) 


Another simple form can be obtained from the original form by doing a little less 


[12)°(34)? tr FC PQ) FB) F(A) 
(34) [34][41](14) (35) (5) 
using f = i|p)(p| and f* = i|p|[p| (from subsection VIB6). Unlike the others, this 


form is directly in terms of physical quantities, namely momentum invariants and 


cancellation: 


(linearized) field strengths (see subsection VIB8). Although similar expressions hold 
in other dimensions, where twistors may not exist, twistors allow for a simpler deriva- 


tion. 


Exercise VIC1.2 
Repeat the calculation for the + — +— (color-ordered) amplitude: 


a Find the form in terms of just ( )’s, or just | |’s. 


b Find the form in terms of momentum invariants and field strengths. 
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The corresponding differential cross section is very simple: Using 


(pq)* =—[pq] => |(pa)|? = I[pall? = -P.-Q 


and momentum conservation, we find ee including the coupling g) 
2 


i 

ITP = oy or oS 
depending on the orientation of the diagram a. respect to time, for this color- 
ordered contribution. (Depending on the color quantum numbers of the external 


states, this can be the only contribution.) Then (see subsection VC7) 


do ad re 
ge = 2l2n)"9 «(4 or a 
A more complicated example is the +++—— amplitude. Again taking color- 


ordered (planar) amplitudes, we choose the amplitude cyclically ordered as +++—— 
with lines labeled 12345, picking 1 and 5 as the reference lines, which we denote as 
@++-—©. Again dropping all graphs with a reference line at a 4-point vertex or 2 
references lines at a 3-point, all 5 graphs with a 4-point vertex are killed, and only 
3 of the remaining 5 survive. (We also need to consider various combinations of + 
and — indices, but only 1 survives for each graph because of the chirality of 3-vertices 


with reference lines.) 


Since 3-point vertices with (without) a reference line have 1 (2) terms, we are left 


with only 6 terms. The initial result for the amplitude is then 


a +1 = +1 ce 
pi ( — 7) PoP) (2 — 2) P3D4 (3 —*) 


“Sete |B PPB) (Pi PPB) CPP 


where we have used the fact that the reference lines have trivial momenta: 1 for the 
component with + index opposite to its helicity, 0 for the remaining components. 


The two terms for each diagram simplify to one, using 
po _ tl Pa es _ 2413] -— BH[-2) [23] 


p (pl Pp. —-~P [—2][-3]  [-2][-3] 
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(applying the cyclic identity) with our normalization. Using this result, we find the 


similar result 


Py tl ps _ (+=)-3] + (+2)/23] (+4) [34] 


Po ps (+2)[-2][-3] (+2) [-2][-3] 
applying momentum conservation. We next translate the momentum denominators 
into twistor notation, and also substitute the spacecone expressions for the polar- 
izations and numerators. Canceling identical factors in numerator and denominator 


(but no further use of identities), the amplitude becomes (+ = 5, — = 1) 


(+4)° ( [4° [41134] 


Gays) esas] ” ya] * G84) 


a (St oan) -- ee (45)* 
(+2)(+3) \ (2—){23) (2) (34) (2— 


)(23)(34) (12) (23) (34) (45) (51) 
applying momentum conservation twice, restoring normalization, and replacing the 


numerals for +. 


Exercise VIC1.3 
Using the spacecone gauge, evaluate all diagrams contributing to the six-point 
gluon (Yang-Mills) scattering tree amplitude (T-matrix) with color-ordered 
helicities +++-+—-—, that correspond to the symmetric diagram with a central 
3-point vertex each of whose legs is connected to another 3-point vertex, each 


of which carries 2 of the external lines. 


These results can be generalized to arbitrary (color-ordered) n-point tree ampli- 
tudes with two — helicities, labeled 7 and j, and the rest + (“Parke-Taylor ampli- 


tudes”): The result is (in an obvious notation), including now the coupling (—g)"~, 


(ij) 


Sten is , a £ sieice i — z oer i m2 NI 
(+1 ig ee itl a j +541 +n) g (12) (23) eo (n = 1, n) (nl) 


Exercise VIC1.4 
Rewrite this result in terms of field strengths and momenta. (Hint: Multiply 
top and bottom by the complex conjugate of the bottom. Unlike the simpler 


n = 4 case, there will be some momenta contracted with field strengths.) 


2. Recursion 


A simple way to derive higher-point amplitudes is using the classical field equa- 
tions. (See subsection VC3. In the literature, the field has often been mistaken for 
the current, since ¢ ~ 6/dJ, J ~ 6/d¢. As usual, these are distinguished by the fact 
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the field always has an external propagator, while the current has it amputated, since 
Ko+...=—J.) The steps are: 


(1) Calculate the first few terms in the series (enumerated by the number of external 
lines). 

(2) Guess the general result. 

(3) Prove that it is correct by induction, using the classical field equations. 


Of course, the second part is the hardest in general (at least when one simplifies the 
third step by using spacecone methods), and has been possible for just a couple of 
cases, only because the results for those cases are so simple. Since these results are for 
off-shell fields, and not S-matrix elements, they are gauge dependent: For example, 
if they are inserted into larger diagrams, the same choice of reference lines must be 


used. 


The solution to the classical field equations is given by tree graphs with all external 
lines but one (the field itself) amputated and put on shell. (The usual external-line 
wave functions describe the asymptotic field, which is free.) The two cases with 
known solutions are those where all the on-shell lines have the same helicity, or one 
different. Note that the field A* has a + associated with the opposite end of its 


external propagator. We then see in the former case, with all +’s on on-shell lines, 


that A~ vanishes because there are no fully-amputated diagrams, even off-shell, with 
only +’s externally (again counting +’s and —’s on vertices). Similarly, for the latter 
case, with only one — on an on-shell line, we see that A~ has only ++— vertices; 
but setting that one on-shell — to be a reference line (which by definition must be 
on-shell), it is not allowed such a vertex, so A~ vanishes also in this case. By similar 
reasoning, we see that A* in the former case consists entirely of ++ — vertices; and 
in the latter case consists of all ++— except for one -—+ (no ++——), which must 


have the — reference line directly attached. 


The appearance of only the self-dual field (At) and almost only the self-dual 
vertex (++ —) means that in both cases one is essentially solving equations in the 
self-dual theory: If we take just the kinetic term and ++— vertex from the action, 
and make the field redefinitions (see exercise VIB6.2) 


At=pd, A =p '¢ 
we obtain (after integration by parts and rearrangement inside the trace) 


: ie ator am = o(4P7¢ + Ip ?, p@) 
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These redefinitions make the ++— vertex local. ¢ appears only as a Lagrange mul- 


tiplier, and its variation gives the self-dual field equation 
pLI¢ + i(0°*9) (ae) = 0 


(which differs from the result of subsection HIC5 by an i from the use of p instead of 
O in the field redefinition, and @ — © from the use of the spacecone instead of the 


lightcone). 


We now consider in more detail the simpler (former) example (the one which 
does not directly give a nontrivial scattering amplitude). As a slight simplification, 
we look at the recursion relation for the field ¢@ as defined in the self-dual theory. The 
recursion relation is now (see subsection VC3), scaling the coupling out of the kinetic 


term, 


o(1,n) = —~Tpey a(1, oli — 1,n)[p- (A, i)p(a a 1, n) =, i)p (a - 1, n)| 


k 
= S- Fane 
m=j 


where we again use color ordering, number the external lines cyclically, and $(j, k) 
denotes the field with on-shell lines with momenta P; through P,. (Thus, on the 
left-hand side of the equation the field has n on-shell lines, while on the right-hand 
side the two fields have i and n—i.) Plugging in the twistor expressions for the vertex 


momenta, we find 


p (1,t)p(i+1,n) — p(1,i)p (+ 1,n) ae 5 (+3) [9k] (+k) 
j=l k=i+1 

If we are clever we can guess the general result from explicit evaluation of the 

lower-order graphs; instead we find in the literature, after the above redefinition, 
1 
gC ee) a ee a) 

where N is the number of background momenta (F;, ..., P;) for ¢(2, 7). For the initial- 
condition case N = 1 this is simply the statement that the external line factor for @ 


is NOW 
Ey 1 


€ — ==. 
? "pp? 

The induction hypothesis is also easy to check: The product of the two @’s from the 

induction hypothesis gives the desired result by itself up to a simple factor: 


(1,4)6(i + 1,n) = —4 ine 
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(The algebra of the color indices works as usual.) We then perform the sum over i 
before that over j and & (the complete sum is over all i,j,k with1@ <j <i<k<n), 
making use of the identity 


(+a)(+b)  (+b)(+c) — (+a)(+c) 


(ab) (be) (ac) G41) GY) 
Gary > UTI HH 


i=j 


Multiplying this by the vertex momentum factor gives a sum over j < k of (jk) [jk] = 


P; - Py, canceling the external propagator, yielding the desired result. 


Exercise VIC2.1 
Work out the analog of the above for the anti-selfdual case, paying careful 


attention to signs. 


This result gives the general perturbative solution to the selfdual field equations 
as an expansion in free fields. By similar methods the more complicated case we 
mentioned can also be solved, yielding the Parke-Taylor amplitudes given above, 
when the one external line is amputated and put on shell. (For this simpler case that 
gives zero, since there is no pole in that line.) We can see the same characteristic 


denominator in both expressions. 


3. Fermions 


We have seen in subsection VIB7 how these methods can be applied to massless 
spinors. Rather than applying the rules directly, in this subsection we examine the 
relation of the results in QCD to those in pure Yang-Mills theory. We also saw in 
subsection VIB7 how supersymmetry could be used to relate different QCD ampli- 
tudes. However, in practice supersymmetry relations give only a few useful relations, 
and only ones that can already be seen directly from the spacecone rules, which give 


more results than can be seen by supersymmetry alone. 


The simplest relations that follow from supersymmetry are the vanishing of tree 
graphs with fewer than two negative helicities, which we saw in subsection VIC1 
follows automatically from the spacecone rules. The remaining useful supersymme- 
try relation for tree graphs is the relation between Parke-Taylor amplitudes for pure 
Yang-Mills and those with one external line each of positive and negative helicity 
replaced with spinors or scalars. The easiest way to see this result is to make use 
of the conventions of the selfdual theory, as in the preceeding subsection. In Parke- 
Taylor amplitudes only one vertex is a non-selfdual vertex, which accounts for the 


simplicity of these amplitudes. (Tree amplitudes with only selfdual vertices vanish.) 
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Furthermore, after transforming to the selfdual conventions, all (nonvanishing) self- 
dual vertices are identical — independent of spin. Finally, the nonselfdual 3-point 
vertex with one negative-helicity gluon chosen as a reference line (the only non-selfdual 
vertex we’ll need for this relation) is independent of the spins of the remaining two 
lines. Consequently, the only difference between the two amplitudes we are relating 
comes from the difference in normalization of external line factors for gluons and 


quarks (and scalars). 


We will not review the superspace formulation of selfdual supersymmetric theories 
here. The main features will be evident from the example of supersymmetric QCD 
that we now examine in more detail. The main result follows from treating the selfdual 
field of the nonsupersymmetric theory as a spacecone (or lightcone) superfield, as in 
subsection VIB7. Dimensional analysis then tells us that the field of helicity h has 


dimension 1 — h. The appropriate redefinitions of the spacecone fields are then 


1 
At mi pAt, yr or pyr, Q = Q, yp = v, AT = a 


for the Yang-Mills fields A*, spinors w*, and scalars ¢. The resulting external line 


factors are then simply 


(+p)-7* 


After these redefinitions, the kinetic terms, selfdual (++ —) vertices, and antiselfdual 


vertices for — gluon reference line (referencing positive helicity) are 


Ly = AS PPA” + pt 5 P yp 


L3,sa = (p At) ([pA*, A] + {pb b}) + (pT) [pat w] 
ae ae ae 
Lye = (2A-) (bat a] + owt w 


for supersymmetric QCD. (In the A® term in the last line we have used integration 
by parts, and dropped a (p*/p)A~ term that vanishes for the reference line: There 
(p* /p”)e_ = 1 now, so (pt /p)e_ = 0 vanishes for that line since p — 0.) 


Exercise VIC3.1 
Apply these redefinitions to the full action for supersymmetric QCD given in 
subsection VIB7: 


a Find the action and external line factors (especially for reference lines). 


b Evaluate the 4-gluon tree amplitude for 2 positive and 2 negative helicities 


with these modified rules. 
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We now see easily that the terms Ly and Lz .q that define the selfdual theory are 
independent of whether boson or fermion is chosen for the positive helicity fields and 
the negative helicity one (only the helicities of the fields must add up to 0 for Lz and 1 
for Ls .q for Lorentz invariance). Thus, supersymmetry is a much stronger restriction 
in a selfdual theory than a nonselfdual one. Finally, the current that couples to the 
reference line (p*/p?)A~ is also the same for bosons and fermions. We therefore have, 


for example, the relation 


for the color-ordered tree amplitudes (where we have labeled helicities +1 by +). 

This follows from choosing line 1 as reference line © (for positive helicity, from a line 

with negative helicity). For example, from our result for the 4-gluon tree, we have 

the 2-quark, 2-gluon tree 

G-btb = 
(23) (34) (41) 

Exercise VIC3.2 


Repeat these calculations using scalars in place of the spinors. 


In the maximally supersymmetric case (N=4 supersymmetric Yang-Mills), there 
is a very simple form for the combined result of Maximally Helicity Violating am- 
plitudes, n-point amplitudes whose external helicities sum to n—4 (or the opposite; 
amplitudes of the selfdual theory would have helicities summing to n—2, except they 
vanish). They can be derived by the methods described above. In the supertwistor 
space of subsection I1C5, with coordinates 

(pl =p", [pl = (p*,7') 
(with p’ = a™ in terms of the notation there) we can write the amplitude as 
gr? 6 laa) 
(12) (23) ---(n — 1,n){n1) 
where we have included explicitly the usual momentum conservation 6-function (which 
is nontrivial in twistor space) as part of its supertwistor generalization. Note that 
the actual supertwistor space used is more of a (anti)chiral supertwistor space, as is 
appropriate for describing selfdual theories (in analogy to that described in subsection 
IVC7 for ADHM twistors). For example, the (- —+---+) (Parke-Taylor) amplitude 
of subsection VIC1 is obtained since the helicity +1 appears at zeroth order in p' in 


the external twistor superfields multiplying this amplitude, and helicity —1 at highest. 
Exercise VIC3.3 
1 


Extract from this amplitude the result for (—, —5, +3, +) given above. 
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4. Masses 


The spacecone formalism yields the simplest method for deriving S-matrix ele- 
ments in massless theories (at least for trees; for loops it may be preferable to use 
background field gauges, with a Lorenz gauge, like Gervais-Neveu, for the quantum 
gauge and spacecone for the background gauge). The analogous method for the mas- 
sive case is to use actions based on self-dual fields, as described in subsection IIC4. 
The advantage of these two methods is that they use fields that are representations of 
the little group, so in the massive case fields have 2s+1 components and only undotted 
spinor indices (SO(3)=SU(2)), while in the massless case they have only 2 components 
and no indices (SO(2)=U(1)). Although the actions used are more complicated, this 
is just a reflection of the fact that algebra that is usually done repetitively in graphs 


has been performed once and for all in the action. 


However, in the massive case the simplification is not as drastic as in the massless 
one: S-matrix elements are just simpler in massless theories, with many vanishing; 
the spacecone method takes advantage of this simplification in the final results by 
simplifying the intermediate steps. The massive examples we will consider in this 
subsection, taken from QED, are somewhat simple in any case, so we will stick to 
the older methods (although the uses of methods based on self-duality are still being 
explored). 


The major difference in simplicity between the massless and massive cases (in any 
approach) is in the external line factors. The ambiguity in the explicit expressions 
for the external line factors is just the little group: In the massless case the solutions 
to the field equations (one solution and its complex conjugate) are unique up to a 
phase factor, which is why the twistor formalism is so useful. In the massive case the 
solutions (2s+1) are ambiguous up to an SU(2) transformation, which means they are 
messy for any choice. Just as in the massless case the twistor is part of the Lorentz 
transformation from an arbitrary frame to the lightcone frame, in the massive case 
the solutions are part of the transformation to the rest frame. In other words, the 
external line factors simply convert Lorentz indices to little-group indices; this makes 


indices trivial for the massless case (in D=4), and not as nice for the massive. 


The result is that in practice whenever any of the external particles are massive 
their external line factors are left as implicit in S-matrix elements, and only their 
squares are explicitly evaluated, in cross sections. This was common in older experi- 
ments (especially QED), since recent experiments are mostly at energies so high that 


masses of external, stable particles are usually neglected. This adds to the algebra, 
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since it means that Lorentz algebra is performed in each of n? terms in the cross 


section rather than n terms in the S-matrix. 


Furthermore, the algebra is usually simplified by considering experiments where 
polarization is determined in neither the preparation of the initial states nor the 
measurement of the final states. This was also common in older experiments, when 
devices for polarization were not well developed. The result is that one averages over 
initial states and sums over final states, producing the same algebraic factors that 
appear in the propagator, as described in subsection VB3: A is replaced with A,. 
One then applies the rules for Feynman diagrams for cross sections, as described in 
subsection VC7. 


The standard S-matrices in QED are the 4-point tree graphs, with 2 3-point 
vertices and 1 internal propagator. There are just 2 graphs to consider, with various 
labelings of momenta: (1) The graph with 4 external fermions (electrons/positrons) 
connected by 1 internal photon describes both Meller (electron-electron) and Bhabha 
(electron-positron) scattering, 2 labelings each. (2) The graph with 2 external photons 
and 2 external fermions, as a continuous line that includes the 1 internal fermion, 
describes Compton (electron-photon) scattering as well as electron-positron creation /- 
annihilation, also 2 labelings each. In each case, the 1 S-matrix diagram results in 2 
cross section diagrams, each with 2 momentum labelings (for a total of 2x2=4): 1 


diagram from multiplying similar terms and 1 from cross-terms. 


In Dirac spinor notation the Lagrangian for QED is (see subsection IIIA4) 
gF? + W(iP —eA+ 2)y 


where we have scaled the “e” into the vertex. The Feynman rules are now (Fermi- 


Feynman gauge): 


Photon propagator: — Mav/ 5p” 
Fermion propagator: (p+ ) /4(p? +m?) 
Vertex: CVa 


where we have used pf? = —3p". (We use the Fermi-Feynman-gauge propagator also 


for defining the cut propagator; ghosts decouple in QED.) 


The cross section diagrams contain closed fermion loops, resulting in traces of 
products of y matrices (with a —1 for each loop by Fermi-Dirac statistics). The 
algebra is manageable for the present case, using the 4D 7-matrix identities from 
subsection ITA6: 


a 


VW Va=—-2, dya=h, Vdbya=a-b, y°bbeya = He 
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ir) =4, tr(d¥) = —2a-b, tr(db¢d) =a-bc-d+a-db-c—a-cb-d 


The traces encountered in the above processes are of the form 
Ny = tr(y*Ay’B)tr(yaC' wD) 


Nog = tr(7*AyaBy’CyD), N3 = —tr(y*Ay’ByaC 1D) 


where A = @+ v5, etc. Using the above identities, these are evaluated as 
N, = 4m* + 2m?(a-b+c-d)+2(a-cb-d+a-db-c) 


No = 4m* + m?[2(a+c)-(b+d)—a-c—4b-d]+(a-bc-d+a-db-c—a-cb-d) 


N3 = 2m*+m?(a-b+a-ct+ta-d+b-c+b-d+c-d)+2a-cb-d 


Exercise VIC4.1 
Generalize the above identities and expressions for the N’s to arbitrary di- 


mension D. 


Our first example is ete~ — ete~ (“Bhabha scattering”). We have aligned 
all momenta to be that of the electrons (i.e., minus that of the positrons), so that 
all numerator factors are p + ro} without signs. Specifically, we have chosen p; for 
the (positive-energy) momentum for the incoming electron, —p2 for the incoming 
positron, p3 for the outgoing electron, and —p, for the outgoing positron. With these 


conventions, 
—8 = (pr- po)” = (p3 — pa)’, —t = (pi — ps)” = (p2 — pa) 
—u = (p1 + pa)” = (p2 + ps)? (pj = —m?, s+t+u= 4m?) 
=> Pi: P2 = P3'Pa = 5s—m’, Pi P3 = P2°Pa = st—m’, Pi Pa = P2°Pp3 = —jutm? 


For the squared amplitude we have for the average over initial polarizations and 
sum over final 
N,(1342 N,(1243 N3(1243) + N3(1342 
Ly“ IT? = 1 1 u( he 3(1243) + N3(1342) 


ia s? st 
po 
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Every other N term is the result of switching s — t (pg © p3, or p; < ps) in the 
previous, since that is the relation of the 2 Feynman graphs contributing to the S- 
matrix. The N, terms are the squared diagrams, while the N3’s are the cross terms. 
The “—” in N3 comes from Fermi-Dirac statistics, switching two fermion lines. (This 


% 


as for 1 fermion loop vs. 2.) 


l k 
i ; bd 


N (ijkl) N,(ijkl) 


is the same relative ” — 


Finally, from subsection VC7 we have the factors to get the differential cross 


section, 


do 
dt 


= 3(2n)*|T PA, No = =[s — (mi + mg)”| [s — (my — mz)?| 


so in this case 


do) __(2nye*_ f(s) + Fw), K+ SW) 
(Fans Renan | 


— 7) ats 
dt s — 4m?) e ? 


st 


a 


The probabilities |T|? for e~e~ — e~e~ (“Moller scattering”), or ete* — e*et, are 
related by crossing symmetry s © u (p, << —p3 or py < —p4): 


(F) et a ! (s) + f(u) | f(s) sl (t) 


i u tu 
A convenient frame for any of these cross sections is the center-of-mass frame 

(subsection [A4). In these cases all the external masses are equal, so the Mandelstam 

variables have simple expressions in terms of the energy (which is the same for all 4 


particles) and the scattering angle: 


s =4E’, t = —4(E? — m?)sin? £, u = —4(E? — m”)cos? 2 
2 2 
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2 @ 2 4 
S Se u *, 
I 1 3 


Another famous example is ey — ey (“Compton scattering”). Now we label 
p, for the incoming electron, p3 for the incoming photon, p2 for the outgoing electron, 


and —p, for the outgoing photon, so the Mandelstam variables are 
= 2. 2 _ oe 2 
—s = (pi + ps)” = (p2 — pa)’, —t = (pi — po)” = (ps + pa) 


—u = (p1 + pa)” = (p2 — ps)” (p 
ed Pi Po = xt —m’, Pi- p3 = —3(s — m’), Pi pa = —3(u—m’) 


p2* ps = x(u—m*), Pa + Da = 9(8 —m”), D3 Pa = —5t 


i 


| 
N2(ijki) ~ N3(ijkl) 


The probability is 


No(1,1+3,2,1+3)  No(1,14+4,2,1+4) 
ths 


= (s — m?)? (u — m?)? 
N3(1,1+4,2,1+3) + N3(1,1+3,2,1+ 4) 
(s — m?)(u—m?) 
mt +m?(3s+u)—su  »m*+m?(3u+ s) — su m?(t — 4m?) 
~2 (ene 2 (a = my (5 — m)(u — m?) 


where now every other term comes from switching s < u (p3 <> p4), and the cross 


section is, after some rearrangement, 


d In)3e4 1 i 1 1 
ea = (net 4m* | ——— ine oft 4m? aie 
dt } compton (8 — ™*)? s—m u-—m? s—-m u-—m? 
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u-m g3oe m 
s—-m u-—m 
A useful frame is the lab frame (i.e., the rest frame of the electron), where in 


terms of the initial and final (positive) energies (EF and E’) and scattering angle of 


the photon we have 
s=m+I%mE, u=m?-ImE', t= mE — E); 


By crossing symmetry, s < t, we get ete™ — 2y (“pair annihilation”) and 


27 — ete” (“pair creation” ): 


do (27)3e4 or 4 Le sy ak 1 
— a 4 — An? | —— + ——_— 
(Tan s(s — 4m?) f—m | um? meu (ae GaP 


do (27)3e4 ; 1 1 2 : 1 : 
ae = —— |.4 et ee ota 4 Fb ge 
= = “ Se ae ae ae 


Exercise VIC4.2 
Calculate all the corresponding massless cross sections using the spacecone 
gauge. Show they agree with the m = 0 case of the above. 

Exercise VIC4.3 
Calculate all the above massive cross sections using scalars in place of the 


fermions. 


Exercise VIC4.4 


Calculate all the above massive cross sections replacing the photons with 
massless 
a scalars 


b pseudoscalars (with a y_; at the vertex). 
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5. Supergraphs 


In supersymmetric theories the easiest way to calculate Feynman diagrams is in 
superspace. Supersymmetric cancellations are then automatic, and new special prop- 
erties of supersymmetric theories are revealed. The derivation of the “supergraph” 
rules is similar to that of subsection VC1, except for some fine points in the treat- 
ment of chiral superfields. The path integral required the explicit evaluation of only 
a Gaussian for perturbation. Since we dropped proportionality constants, this was 
equivalent to substituting the solution to the classical, free field equations back into a 
quadratic action. For real scalar superfields (used for super Yang-Mills) this is trivial, 
but chiral scalar superfields (used for scalar multplets) satisfy the chirality constraint, 
and have superpotential terms: integrals over chiral superspace (f dx d?0), not the 
full superspace (f dx d*@). 


We want to make use of the identity for evaluating the path integral (see subsec- 
tion VC1) 


du 
V2 


Then the “action” we need to integrate is 


g= - f ae d'0 bo — [/ da 20 (2)46? + he. = (| dx 0 os + he.) 


consisting of the (derivative part of the) kinetic term, mass term, and (minus the) 


—uMu du —uMu wW f-1 
e7uM fut) = = oe uMu/2, & f(v) ~ eIveM Ov/2 F (1) 


term that acts on e~*!!¥l, Solving the field equations (see subsection IVC2) 


” h) mc 
72 m _ 72 m _ 


a Pea) é= ! pe =a 3 
(tm) 6p VP0p)? — *  a(-O +m?) Vi bp V2 0G 


The propagator exponent f $(6/dy)(1/K)(5/d¢) thus becomes (putting back y — ¢) 


6 1 6 ) 1 6 
dz d*6 — | —____ } — ded’ 0 =| Se) ne ie 
feraos( yoaram) i+ | O88 (Aycosm) ie | 
Before writing the Feynman rules, we first note that functional differentiation 


with respect to a chiral superfield, as follows from the above variation, gives 


t) 


ioe, ) = d°)°(0 — 0')6(x — 2’) 
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This means that there will be an extra d? at the ¢ end of any chiral propagator and 
an extra d? at the ¢ end. We could associate these directly with the propagator, but 
we will use one factor of d? to convert a J #6 into [ d*0 at any superpotential vertex, 
and similarly for the complex conjugate. Therefore, we include such factors explicitly 
as a separate Feynman rule for the ends of chiral propagators. However, this means 
the d¢ propagator (and similarly for ¢@) gets an extra factor of d?/ sO to compensate 
for the fact that we include two d? factors, whereas it really had only one because 
its integral was only d?0. Furthermore, we Fourier transform x as usual, but not 6, 
basically because there is no translation invariance in #, but also for a better reason 


to be explained soon. The Feynman rules of subsection VC4 are then modified as: 


(A235) Theta’s: one for each vertex, with an [ d‘0. 
(A3’) Propagators: 


(A43) Chiral vertex factors: d? on the ¢ end(s) of every chiral propagator, 


d? on the ¢ end(s), but drop any one such factor at superpotential vertex. 


We next explain how @ integrations are performed on any connected graph. Con- 
sider any two vertices directly connected by a propagator. All the spinor derivatives 
acting on its 64(6 — 6’) can be removed from that propagator by integration by parts. 
We then can use that 6 function to trivially integrate over 0’, removing the [ d*0’ and 
that 64(@ — 6’), and replacing 6’ everywhere with 6. Effectively, those two vertices 
have been contracted to the same point in @ space, eliminating that propagator as 
far as 8 dependence is concerned. We can repeat this procedure until all vertices are 
contracted to a single point. However, we are then left with a “tadpole” for each 
loop: Contracting propagators this way sequentially around a loop identifies all the 
vertices of that loop, and leaves the loop as a single propagator with both ends at 


that point. To evaluate this tadpole, we note that 


[Pa254*(0 — O’)\lor—p = 1 
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(0’ derivatives can be converted into minus 6 derivatives when acting directly on 
the 6; this is basically integration by parts.) Fewer derivatives give 0; more can be 
reduced to terms of 4 or less. This completes all the evaluation in @ space, leaving an 
expression in terms of fields (some with d’s acting on them) with different momenta, 
times the usual momentum factors, with the usual momentum integrals, but all at 
the same 9, with a single [ d*0. This means that the generating functionals W and 


I are completely local in 6. 


There is a further consequence of this evaluation. We have obtained terms with 
J d*6, but none with [d?6. However, to do it we had to introduce the factors 
d?/(—$p”) into the ¢¢ propagators. On the other hand, such a factor can easily 
be killed by a d? from a vertex: We sandwich the d? between a d? from each vertex, 
using the identity d?d?d? = —4pd?, and return the d? to one vertex. The only time 
we can’t do that everywhere is if every vertex is a superpotential (so every propagator 
in the graph is ¢@ and every external field is ¢), since otherwise we can inductively 
borrow d?’s from some non- { @6 vertex. Any such 1PI graph always vanishes, be- 
cause there are exactly enough d’s left to make the tadpoles nonvanishing, leaving an 
J @*6 of a product of ¢’s with no d’s, which vanishes. On the other hand, for a tree 
graph there is exactly one d?/p” left, which converts the { d*@ to an f d?6. 


The net result is that not only are W and I’ local in 6, but only their classical 
parts can have [ d?@ terms, and the spurious d?/p* factors (which should not appear 
in massive theories) are always canceled. In particular, this implies that all UV 
divergences are { d‘@ terms: All terms in the superpotential are unrenormalized (no 


loop corrections) to all orders in perturbation theory. 


Exercise VIC5.1 
Calculate all the contributions to W[@, ¢] from 4-point trees in massive super- 
@° theory, and write the result in both p- and z-space (in analogy to the 


nonsupersymmetric example at the end of subsection VC4). 


Improvements again result from using background-field gauges. We have already 
seen in subsection VIB10 the modification to the quantization for supersymmetric 
background-field gauges. The background-field expansion can be defined by solving 
the constraints on the full covariant derivatives in terms of quantum prepotentials 
but background potentials (A,4, not V), essentially by covariantizing d4 to the back- 
ground Vy. Then Vj, can be manipulated (integration by parts, etc.) in the graphs 
in the same way as d, was, leaving only A, (not A,) and its derivatives (W,, etc.) as 


background fields. This leads to improved power counting, and can be used to prove 
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“superrenormalizability” (finiteness beyond one loop) for N=2 extended supersym- 


metric theories, and finiteness for N=4. 


As for other gauges (background-)chiral superfields need special treatment, now 
to get the most out of background gauge invariance. Variation can be defined in the 


obvious way, but now we also need the covariantized identity 
D°D*¢ = (0+ i[W*, Dal) 


from pushing the D’s to the right and using the commutation relations. The func- 
tional integral over the quantum background-chiral superfields can also be performed 
in the same way as for other gauges, the only modifications being background co- 
variantization (including the above “nonminimal”’ term for O), and the fact that 
we can no longer neglect the “vacuum” contribution (one-loop diagrams with only 
background fields externally). Specifically, if we look at the general derivation of 
the Feynman rules in subsection VC1, we see it gave rules for all graphs except the 
one-loop vacuum bubble, since this graph has no (quantum) vertices. These rules, as 
adapted to superspace earlier in this subsection, are now modified only by the covari- 
antization just discussed, which only adds background potentials (not prepotentials) 
and field strengths to propagators and vertices. The background-covariantized prop- 
agators then can be further expanded about the free 0. The net result is that in 
all diagrams except (perhaps) these chiral one-loop vacuum bubbles the background 
fields appear only in the form of potentials and field strengths. These vacuum bubbles 
then can be evaluated by the usual methods, since the formerly neglected Gaussian 
path integral for these “quantum-free” fields is just the usual one-loop path integral 
for a chiral superfield with external Yang-Mills superfields, only the external fields are 
now identified as background instead of quantum. In some cases, this last calculation 
can be further simplified to again yield an expression directly in terms of potentials 


without explicit prepotentials (see subsection VIIIA6 below). 
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VII. LOOPS 


Although our analysis so far is sufficient to evaluate the lowest-order term in the 


h expansion (“trees”), certain new features arise at higher orders. 


pladanintasiatenactencedess A. GENERAL ......0.0200 ee: 


When infinities were first found in perturbative quantum field theory, they were 
thought to be a serious problem. A prescription can be given to remove these in- 
finities, called “regularization”. It was later shown that this regularization can be 
defined in such a way as to preserve all the desirable physical properties of the theory, 
called “renormalization”. Unfortunately, it seems that the original infinities, exiled 
by renormalization from any finite order of perturbation theory, return to plague field 
theory when all orders of perturbation theory are summed. Therefore, renormaliza- 
tion should not be considered a cure to the disease of infinities, but only a treatment 


that allows divergent theories to be more useful. 


1. Dimensional renormalization 


“Perturbative renormalization” is defined to preserve the three properties that 
define relativistic quantum field theory (Poincaré invariance, unitarity, causality). 
The most general prescription is to start with a classical theory that is causal, Poincaré 
invariant, and satisfies the semiclassical part of unitarity (as described in subsection 
VC5). This gives the tree graphs of the theory. One then applies unitarity to define 
a perturbation expansion, determining the higher orders (loop diagrams) from the 
lowest (trees). Although the usual loop diagrams are divergent, there is enough 


ambiguity in the unitarity condition to allow removal of the divergences. 


The only practical way to implement this procedure is to slightly modify the di- 
vergent graphs one obtains from the naive use of the Feynman graph rules (obtained, 
e.g., from path integral quantization of a classical action). The steps are: (1) “Reg- 
ularize” each divergent graph by modifying the momentum integrals, introducing a 
parameter(s), the “regulator(s)”, giving a finite result that reproduces the original 
divergent integral in a certain limit. (2) “Renormalize” each regularized graph by 
subtracting out the “divergent part” of the regularized graph, keeping only the “fi- 
nite part”. Once the graph has been rendered finite, the regulator can be dropped. 


One then has to check that the method of removing divergences, order-by-order 


in the perturbation expansion, preserves the three properties of relativistic quantum 
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field theory. The easiest way to do this is to use both a regularization scheme and a 
subtraction scheme that preserve these properties manifestly. The standard way for 
the subtraction scheme to do this is to change the coefficients of terms in the classical 
action by (real) constants that depend on both the regulator and h. Since the classical 
action already satisfies Poincaré invariance, causality, and the semiclassical part of 
unitarity, this will automatically preserve all the desired properties. In this manner 
of renormalization, there thus remains only two steps to prove that a theory can be 
renormalized: (1) the existence of a regularization that manifestly preserves the three 
properties, and (2) that the modification of the action by making the coefficients 
regulator- and h-dependent is sufficient to cancel all divergences that might reappear 


in the limit as the regulator is removed. 


The latter step, discussed in subsection VIIA5 below, can be further divided into 
substeps, proving: (a) The ultraviolet divergence in any graph corresponding to scal- 
ing all internal (integration) momenta to infinity (the “superficial” divergence) comes 
from a term in that graph polynomial in external momenta, and can therefore be can- 
celed by a local term from the action; and (b) recursively in the number L of loops, 
if the renormalization procedure has already been successfully applied through L—1 
loops, the resulting modification of the action is sufficient to cancel all divergences 
appearing at L loops except the superficial ones. The former substep is the one that 


detemines whether the theory can be renormalized. 


The former step is satisfied by dimensional regularization, the standard method of 
regularization in relativistic quantum field theory (and for practical purposes beyond 
one loop, the only one). It is defined by writing the theory under consideration in 
arbitrary dimensions D, and treating integrals over loop momenta as analytic func- 
tions of D. These integrals are then analytically continued from lower D, where they 
are (ultraviolet) convergent. The resulting expressions have pole singularities in D at 


integer D, so these poles can be subtracted out as the divergent parts. 


There are two main reasons why dimensional regularization is so useful: (1) Most 
classical actions can be written as easily in arbitrary dimensions as in D=4. (The 
important exception is those that in some way involve the Levi-Civita tensor €gp...c. 
The difficulty with such theories is not a drawback of dimensional regularization, but 
a general property of quantum field theory, and is related to the quantum breakdown 
of classical symmetries, to be discussed later.) In particular, this means it manifestly 
preserves gauge invariance (which is a part of unitarity), the property of relativistic 
quantum field theory most difficult to preserve. It is also the only workable scheme 


of regularization to do so. 
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(2) It requires only one regulator, the number of dimensions D itself. (Most other 
regularization schemes require at least one regulator for each loop.) This is the main 
reason why this scheme is the only practical method of regularization at higher loops. 
(An enormous number of regularization schemes have been proposed, almost all of 
which work well at one loop, but all of them are more difficult than dimensional 


regularization already at two loops, and even worse at three loops and higher.) 


The renormalization scheme based on this regularization is also very simple: 
Defining D = Dp —2e (where Dp is the physical dimension, usually 4), we can Laurent 
expand any L-loop amplitude in e¢, starting at 1/e’. These 1/e” terms arise from n 
or more divergent D-momentum integrals. If we cancel all the negative powers of e€, 
we can take the limit D — Dp (€ — 0) by just dropping all the positive powers of e, 
i.e., evaluating the remainder at ¢ = 0. The procedure is to modify the coefficients 
of the terms in the “classical” action (couplings, masses, and field normalizations) by 
making them e and fi-dependent, giving them i*~'/e” (“counter”)terms. Such terms 
can cancel any local divergence at L loops. One then has to show that they also can- 
cel all nonlocal divergences at higher loops resulting from this divergence appearing 
in an L-loop subgraph. Thus, the procedure is recursive: (1) apply the counterterms 
obtained from calculations at less than L loops to cancel subdivergences; (2) cancel 
the remaining, local, superficial divergences by introducing new L-loop counterterms. 
The form of the superficial divergence can be determined by evaluating the divergence 
coming from the region of momentum space where all loop momenta go to infinity 
at the same rate. The superficial divergence is determined by 1/e terms of this loop 
and of subloop divergences; however, if the 1/e piece of a prospective counterterm 
vanishes at a certain loop order, so do all higher powers at that loop order. Thus, sim- 
ple power counting (as well as global and local symmetries) is sufficient to determine 


what counterterms can appear at any particular loop order. 


These rules are sufficient for evaluating momentum integrals to the point where 
renormalization can be applied. However, further simplifications are possible where 
spin is involved: Techniques specific to D=4 are useful to simplify algebra in general, 
and required to preserve manifest supersymmetry in particular. These methods treat 
spin indices as 4D, in contrast to the vector indices on momenta (and coordinates), 
which are analytically continued away from D=4 by the definition of dimensional 
regularization. This is natural in 4D N=1 supersymmetric theories formulated in 
superspace (or 4D N>1 in N=1 superspace), since there spins < 1 are described by 
scalar prepotentials: There the simple prescription is to continue in the dimension 


of the commuting coordinates (a), while fixing the dimension of the anticommuting 
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coordinates (). These rules, for either supersymmetric or nonsupersymmetric the- 
ories, have a simple physical interpretation for integer D<4: dimensional reduction. 
The reduced theories differ from those produced by simple dimensional regularization: 
Vectors get (4—D) extra scalars; spinors may become multiple spinors. For super- 
symmetric theories this is again natural, since vector and scalar multiplets remain 
irreducible after dimensional reduction. Unfortunately, vectors in nonsupersymmet- 
ric theories reduce to vectors plus scalars that are not related by any symmetry, so 
their renormalization is independent. However, the complications produced by the 
extra renormalizations are usually smaller than the algebraic simplifications resulting 
from the restriction to 4D spin algebra, especially for lower loops. Another compli- 
cation is that Levi-Civita (€) tensors can’t be treated consistently in the dimensional 
reduction scheme. Although serious in principle, in practice this is not a problem as 
long as axial anomalies cancel, which is required anyway for unitarity. (See subsection 
VIIIB3. Also, axial anomalies are easier to calculate using Pauli-Villars regularization 


than with any form of dimensional regularization. ) 


Exercise VITA1.1 
Consider the identity 
OnPcIaT eS f) = 0 


which holds in D < 5, where 6 is a D-dimensional Kronecker 6, as appears 
from momentum integrals (since momenta themselves are D-dimensional by 
definition), and p,q,r,s are momenta. Show by index contraction that an 
inconsistency arises when trying to analytically continue away from D = 4. 
This difficulty in defining totally antisymmetrized D-dimensional objects is 


why Levi-Civita tensors don’t exist in dimensional regularization. 


2. Momentum integration 


The first step in performing momentum integration is to make all integrals Gaus- 
sian by exponentiating propagators using Schwinger parameters 
1 7 2 2 
CO dt eT +m )/2 
3(p? + m?) 0 
The general momentum integral in an arbitrary Feynman diagram is then 


. d'Pk P —k?.A(r)k/2—k?-B(7,p)+ik? -2—C(7,p,m) 
A = N(p, —10z) (Qn)ED dT e : as - 


where A, B,C are first-order in T; B is first-order in p while C is second-order in p,m; 


and L is the number of loops and P the number of propagators. Also, we have used 
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matrix notation with respect to the L-dimensional loop-space, with respect to which A 
is a matrix, B is a vector, and C isa scalar. Finally, N represents all “numerator” fac- 
tors (propagator numerators and vertices; everything but propagator denominators) 
and has been brought outside the integral by Fourier transformation (i.e., introducing 
x to produce a generating functional for all numerators). The momentum integrals 


are now Gaussian, and can be evaluated by the methods of subsection IB3 (and VA2): 


A= N(p, —i0;) per (det APP ee Ae eee 


xz=0 


Some of the 7 integrations can be performed by various scalings of subsets of the 
T’s. For example, to see the superficial divergence of the graph we scale all of the 7’s 


and insert the identity as 


i= [ aa(a-Yn), T; = ro, 
0 i 


where a are “Feynman parameters”. The amplitude then becomes 
A = N(p, —idz) / d \P-I-LD/2 GPa § (1 ay a) [det A(a)|-?/? 


x oe Aer Bae Oe a ae ee ae) 


(This method of introducing Feynman parameters is equivalent to directly changing 


variables from r’s to A and one less a, and finding the Jacobian.) 


The x derivatives in N must now be taken. For a contribution from these deriva- 


tives of order A~", we have A integrals of the form 
fe )P-1-LD/2-n .d[B7 A~*B/2-C] = r(P = 4LD _ n) [C _ ip Ate eee 
where we have used the definition of the I’ function 
Fie) -{ dd te 
0 


This integral converges only for Re z > 0, but we can extend it to (almost) all z by 


analytic continuation: Using integration by parts, 
[oe] d [oe] 
ei(Z\= | dy eos = (e*d*) | + | di Ve * =I (e+1) 
0 0 


in the convergent region. Analytic continuation then says to evaluate the integral for 
T(z) in the region 0 > Re z > —1 as I'(z+1)/z in terms of the integral for ['(z +1), 


and so on: . 
lee) n 1 lee) 
z-1,-A _ ztn.—-A 
/ ax oe te - (a) f OO anak: 


k=0 
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Thus, I’(z) has simple poles in z at all the nonpositive integers. 


Exercise VITA2.1 
Using the identity (z+ 1) = zI(z), derive the following special cases for 
nonnegative integer n: 

(2n)! 

ie 


P(n+i)=nl,  P(nth)=(n-})n- 9). AVE = 


Feynman parameter integrations give more complicated functions. A simple but 


common example is the Beta function 


P(a)Py) 


Bz,y) = / da a® *"(1—a)*1 = Tia) 


The latter expression for the Beta function can itself be derived by similar methods: 


Exercise VIIA2.2 


Derive the following B function identities: 


a Use the integral representation of the I’ function to write an expression for 
I(a)I'(b) as an integral over two Schwinger parameters, and introduce a scal- 
ing parameter (as with general two-propagator Feynman graphs, except here 
there is no momentum). Show the result is (a+ b)B(a, b), where B is given 
by the integral definition, thus proving the Beta function can be expressed in 


terms of Gamma functions. 


b Use the integral definition of B to prove 


Bie, 2)= 2!" B(s, 2) => a = ane 


Use this result to find the expression in exercise VITA2.1 for (n+ 5). 


c Derive the identity 
[ doe ary = Bia 
0 
from the substitution tT = z/(1 — z). Use this to show 
I(z)fQ — z) =m csc(1z) 


by changing variables 7 = e“, and closing the contour in the complex plane 


to pick up the contributions from the poles. We thus have 


P(z+n)P(1-—z-n) = (-1)"P (2) -2z) 
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(Hint: To drop the 
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= F(z). 


(which is also seen inductively from I'(z + 1) 
contour at oo in various directions, it might help to work in a particular region 


of the complex z (not u) plane, and analytically continue.) 
In general UV divergences will come from powers of 1/e in a I’ resulting from 
integration over some scaling parameter. We thus need an expression for the Laurent 


expansion of I’(z). This can be obtained directly from the integral expression: Using 


ie) = li i = 2) =- ii 7B +1)=21i . 
(z) Beet : ( *) ed (2,70 zm n II _ 
from the change of variables \ = na, and the above identities. Defining the “Euler- 


3s 


the definition of e as a limit 


Mascheroni constant” y by 
y= lim (- n+ >) :) = 0.5772156649... 
=> e Y= lim n’ I] en 
m=1 
we then can write = ; 
e* nm 
T(z) = 16? 
(z) ze II 1 + = 


which is an alternate definition of [. We then have 


In D(L—2) = 72+ 5 [-inQl- )-4] = 
n=1 
(oe) [oe] 1 
C(y) = ~ mi 


InD(A-—z)=yz+ es 


=2 
by Taylor expansion of the In, where ¢ is the “Riemann zeta function 


Exercise VITA2.3 
Derive the following I’ identities from the previous 


a Find the first two terms in the Laurent expansion of I'(z) 
lim I°(z) = lim, 2P(z+1)=2-74+ O(2), i -{ dX (In A)e™ 
2 = 0 


b Do the same for expansions about other integers 


1 
i+e(-4 02) LO 
m1 


I(n+1+e6 =n! 
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P(-n+e) = (152 


|i +e(- +> x) +0 


c Using the csc relation in exercise VITA2.2c and the above expansion of 


In P'(1 — z), show that 
(: ag 2 = Zz 


and thus ¢(2n) can be written as a rational number times 7?” 


cot 2 = 


RilR 


3. Modified subtractions 


The convenient normalization for the quadratic part of the gauge-invariant action 


for an arbitrary field theory we use is 


(gp) P-9/2 [oe dex 
( 


Ss, = 
: ge 27) 


Be 3eKS 
for a real field @ and some coupling constant g, where for bosons 
Kk =3(-O+m’) +... 


The explicit factor of 1/2 cancels the factor of 2 obtained when varying the action with 
respect to ¢. (Similar permutation factors are used for interaction terms.) Equiva- 
lently, it gives the natural normalization for Gaussian functional integration over ¢ 
in the field theoretic path integral. For complex fields we instead have ¢K@ without 
the 1/2, since then ¢ and ¢ can be varied (or integrated over) independently. 


The “renormalization mass scale” ~ has been introduced to preserve the mass 
dimension of g in arbitrary spacetime dimension; it appears naturally with the nor- 
malization Sp because the kinetic operator contains sp" and $m’. Generally there 
will be more than one coupling, but only one y. For some purposes it may be con- 
venient to scale the fields so that the coupling and 4: dependence appear only in the 
interaction terms. We will usually suppress the dependence, since it is determined 
by dimensional analysis, and is relevant only for quantum corrections, where D 4 4 


becomes important. 


Note that our normalization differs from that normally chosen in the literature. It 
has been chosen to give the normalization appropriate for Gaussian integrals, which 
appear in both the first- and second-quantized theories. The first difference is the 
factor of (27)~?/? for coordinate and momentum integration (rather than the usual 1 


for coordinate and (27)~? for momentum); the second is the factor of 1/2 multiplying 
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the kinetic operator p? + m? (contained in K), rather than 1. Our normalization is 
more natural not only for Gaussian integration and Fourier transformation, but also 
slightly simplifies perturbative field theory calculations, allowing one to ignore spuri- 
ous factors of 2 and especially 27. The net effect is only to change the normalization 


of coupling constants, since such factors can be absorbed into the 1/g? sitting in front. 


For example, the most accurately experimentally verified prediction of quantum 
theory is the anomalous magnetic moment of the electron, to be discussed later. The 
result for the total magnetic moment, in various normalizations, to second order in 
perturbation theory, is 


2 2 


e Cy 
Lmag 6 = a) 


6679 


where “e” is for our normalization (obviously the simplest), “e,,” is the normalization 
you first learned in classical mechanics (the one that gives e?,/r? as the electrostratic 


” is the one you 


force between two electrons, assuming you used cgs units), and “er, 
would see in other quantum electrodynamics courses (or in classical mechanics in 
mks/SI units, if you ignore the €)). To complicate matters, you may have also seen 
the definition a = e?, = ey /4n, of which the only merit is supposed to be that 1/a 
is very close to the integer 137, which is silly since 1/e? is even closer to the integer 
861. (Actually, a is the natural expansion parameter for nonrelativistic quantum 
mechanics, which is basically 3D, but our e? is more natural for the loop expansion, 
which is inherently 4D.) We have also used units i = 1; restoring it introduces the 
further complication a = e7,/h = e},h/4m because of the difference in the semiclassical 


expansions for quantum mechanics and quantum field theory. 


Furthermore, for nonabelian groups we have an extra factor of $ compared to the 
standard normalization because we normalize trp(GG;) = 6;; instead of trp(GiG;) = 
30ij: The latter originated from the case of SU(2), where it cancels the V2 in the 
diagonal generator (and in the others, if one uses hermitian ones rather than raising 
and lowering). Unfortunately, for SU(N) with N > 3 this historical normalization 
introduces a \/2, while not canceling factors like YN (and making trp(G,;G;) N- 
dependent would wreak havoc when considering subgroups, as well as for raising and 


lowering operators). Thus, in the above notation, 


2 
2. oi 


belian : = 
nonabelian g 6x2 


Although we have chosen a normalization for the coupling constants that is nat- 


ural for Gaussian momentum integration, and for symmetry with respect to Fourier 
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transformation, in divergent integrals it has the disadvantage of having y’s (Euler- 
Mascheroni constant) in the finite parts. Another natural normalization that gets 
rid of this irrational number from all graphs is to divide out the angular part of the 
integrals: The volume of the unit D—1-dimensional sphere (the surface of unit radius 


in D-dimensional Euclidean space), is easily evaluated with Gaussians: 


ioe | cae ot = (any? fers) | * dk kP-te BP 
(21) P/2 : 


= srg)? [arto 
We might therefore choose our normalization to cancel this factor in momentum 
integrals, along with the (27)~? from Fourier transformation. Then the action, e.g., 
for a massless scalar, might be normalized as (with conventional kinetic term) 


af Pg FE ERM age HP f__ 2 __ rags 
y= Ho f aa S00)? = © lar” 


This differs from our previous normalization by a factor of 1/I°(2), which is 1 in 
exactly D=4, but differs infinitesimally far away. The result is that the two schemes 
will differ effectively by finite renormalizations: For example, in divergent one-loop 
graphs the 1/e divergences will have the same coefficient in the two schemes, but 
the finite remainders will differ by constants, since effectively the coupling has been 
redefined by a factor of 1+O(e). Thus, the same result can be achieved by modifiying 
the counterterm to be proportional to 1/e + constant. Hence, the earlier version of 
dimensional regularization is called “minimal subtraction (MS)”, while the modifi- 
cation inspired by the volume of the sphere is called “modified minimal subtraction 
(MS)”. 

We now examine explicitly the difference between the two schemes. As we can 
see from our previous example, momentum integrals from scaling various subsets of 
Schwinger parameters in a multiloop diagram will produce I’ function factors of the 


form, with this new normalization, 
[[e@Qu“ ra - 2:4) 
for some integers n;, where }),L; = L. In even dimensions Dp (especially 4), we 


can use I'(z + 1) = zI(z) to write each L,-loop factor as a rational function of 
€ = (Do — D)/2 times 


[rd — 6/4 r(1 + Lye) ~ (e%) He - "= 1 
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where we have written only the y dependence. Thus all y’s cancel. This modifica- 
tion allows some further simplification by eliminating extra terms arising at 2 loops 
involving ¢(2) (but this is only 77/6; see exercise VIIA2.3). Similar results can be 
obtained by using I°(1 — €) instead of I'(2). 

Another subtraction scheme, the “G scheme”, is defined by normalizing momen- 
tum integration so that the coefficient of the 1-loop massless propagator correction 
in ¢® theory in 4 dimensions (see subsection VIIB4 below) is exactly 1/¢€ (without 
extra finite terms) times a power of 4p’, or that up to a sign in higher even dimen- 
sions. (The normalization factor must be positive, and also finite and nonvanishing 
as € + 0.) As for MS, we can also pull out rational factors to get just the (1+ ne)’s. 
The net effect of these two schemes, as compared to MS, is to modify h, which appears 


as { dx/h in the classical action or h [ dp for loop integrals, as 


MS: hoI(2)h or P(1—eh 
. (<1)? PA) 
Ws ra >) p2-12-1° Ta+era—ep" 


(If we want to be picky, we can also normalize the former forms appropriately for 
D = Do, by including an extra factor of 1/'(22) for MS and (42 — 1)/I'(Do — 2) 
for G.) 


Exercise VITA3.1 
Explictly evaluate the difference between the MS, MS, and G schemes to order 
€, including the picky Do factors. 


This particular fix for eliminating irrational numbers works only for those arising 
at one or two loops: In general, because subdivergences produce expressions of the 
form I'(1 + €)/e* at L loops, we encounter finite terms involving ¢(L) at L loops, 
which is irrational (worse than just 7’s, e’s, V/2’s, etc.) for odd L. So, for example, 
¢(3) appears at 3 loops, and it can be shown that ¢(3) (and higher ¢(n)) can’t always 
be canceled. 


” rather than simplifying numer- 


The “momentum subtraction scheme (MOM) 
ically, is designed to give a more physical interpretation of the coupling constants 
appearing in the action: It is defined so that they take their on-shell values. Thus, 
it is particularly suited to low-energy calculations, which involve an expansion about 
the mass shell. For example, consider the quantum kinetic operator K + AK, ap- 
pearing in the quadratic part of the effective action. It depends on the momentum 
through one variable: p? or #, etc. We then can consider Taylor expanding it in this 


variable about its classical on-shell value. (For reasons to be explained later, this 
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can be dangerous for massless fields, when it requires infrared regularization.) This 
is equivalent to expanding in powers of the classical kinetic operator K itself. The 
MOM prescription is then to use subtraction terms —dK to cancel the terms in the 


quantum correction AK to the kinetic operator that are linear in K: 


AK=a+bK+O(K*) => 6K=-a-—bK 
1 1 
=> Krn=K+AK+6K=K+O(K’*) = Z =e OU) 


The two renormalizations are related directly to the “wave function” and mass renor- 


malizations, one being proportional to the entire kinetic term, the other to the con- 
stant (mass) term. The result is that the renormalized propagator has the same pole 


and residue as the classical one. 


Note that the MOM scheme, unlike the others, does not introduce an independent 
mass scale yw: Only physical masses set the scale. This is a consequence of the fact 
that the MOM scheme is designed for studying low-energy (near-mass-shell) behavior, 
while the others are more suited for studying high-energy behavior. This will be 
important for our explicit calculations later, when we see that MOM is more useful for 
QED, which is better defined (and thus more useful), in terms of perturbation theory, 
at low energies, while QCD is better defined at high energies. More precisely, the on- 
shell values of QED masses and couplings are observed experimentally, whereas those 
of QCD are almost meaningless, since the corresponding particles are not observed 
as asymptotic states. On the other hand, in QCD the introduction of the arbitrary 
scale yz allows the definition of a more physical mass scale, and its arbitrariness can 


actually improve the accuracy of perturbative calculations. 


4. Optical theorem 
Writing the S-matrix as S = 1+ 77, unitarity can be written as 
1=SiIS=(1-i)1477) =14+i0T-T)+7TT 3s TT =i(T'-T) 


T since then T represents 


(Actually, the more useful statement is in terms of S = e° 
the connected graphs, but the result of the argument is the same.) Summation of a 


probability over final states then yields the “optical theorem” : 
> Fe? = SOaITUNAGIT |) = GIT'T |i) = 2 Im Ti 
f f 


Applying unitarity in terms of the cutting rules (subsection VC6), we see that this 


condition can be applied to 7;; diagram by diagram, using any combination of parts 
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of Ty; and T' >; that fit together to form the graph considered for J;;. (For example, 
the probability coming from a tree graph with two final states is the imaginary part 
of a one-loop graph with two intermediate states.) Separating out the momentum- 


conservation 6-function for a connected S-matrix element, we have finally 
T=5(Sop)T > Y8(Sp) (tah =2 Im 7 
f 


The simplest example of an experimental measurement of an interaction is a 
decay rate. (The only particle properties contained in the free Lagrangian are mass 
and spin.) For example, at the tree (classical) level, decay into 2 particles is described 
by just the 3-pt. vertex, while for decay into 3 particles, it can be described by a 4- 
pt. vertex or a tree graph with 2 3-pt. vertices. More generally, by the optical theorem 
the decay probability is given by the imaginary part of the propagator correction, 


evaluated on shell. 


We then find the total decay probability per unit time by dividing the probabil- 
ity by the spatial density p times the spatial volume times the time duration, and 
summing over final states (see exercise VC7.1): 

dt ; pVp Ww 
using the expressions for p, and P in terms of |T’p;|*, given in subsection VC7. The 


optical theorem can be applied similarly for the total cross section: 


(2n)P/? 


12 


The decay rate for a particle is frame dependent, but we usually pick the rest 
frame for massive particles, where w = m. Alternatively, we can define the total 
decay probability per unit proper time: 
dt dP 2ImT;; 
ds ds om 
(where s and t should not be confused with the Mandelstam variables). For the 
massless case, we can use instead the parameter T, as it appears in classical mechanics 
in the gauge v = 1, or as the classical value of the Schwinger parameter from the 
Landau equations: 

ac” dP 


7=-— > —=2ImT;, 
P dt dt a 


Exercise VITA4.1 
Compare this result for dP/ds with that of exercise VC7.la to obtain an 
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explicit expression of Im T,; in terms of |T';|? for the decay of a particle of 
mass M into two particles of masses m,,m2. What happens to Im T;; for 


M =m,+me, and for M < m,+ m2? 


Now the decay rate of a particle can also be associated with the imaginary part 
of the mass, since 


M=m-—ir = [ab|? ~ eae 


so the wave function for the particle at rest automatically includes a decay factor. 
The probability of decay, normalized by dividing by |w|?, is thus 
dP d(1—|p|?) _ 


— = Tt Lor 


dt [pdt 


The analogous statement in momentum space is found by Fourier transforming the 


propagator/wave function from time to energy: 


1 1 1 1 
—iMt = ee 
FSM oan Bam ee 
This expansion is in terms of the free propagator 1/(#—m) and the connected graphs 


—0r. 


We now check that this result agrees with that obtained from the effective action. 
The quantum propagator has a pole at p? = —M? for some complex constant M: 
1 
li A= ———— 
Po M2 5(p? + M?) 
We have normalized the propagator at the pole by rescaling the field; we also keep 
the real part of the mass the same as the classical value through renormalization of 
the mass term. (Only the real part can be renormalized consistently with unitarity.) 
The on-shell condition is then at physical (real) momentum p? = —m?, so the kinetic 
operator on-shell is 


3(p? + M?) = —3m? + 3(m — ir)? = —$r? —imr 


Remembering that “interaction” terms from I’ contribute with a minus sign to am- 


plitudes, we then have 
dP 2ImTy — 2mr 


dt m m 


5. Power counting 


We now consider why subtracting out divergences, as poles in D, can be imple- 


mented by giving singular D-dependence to the coupling constants. This is based on 
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dimensional analysis, which tells us how divergent a graph is at large momenta: the 
“ultraviolet” (UV) divergence. (There are also infrared divergences, which occur for 
physical reasons, and do not require renormalization. They occur only for massless 
particles, and will be considered later.) Consider first any 1-loop 1PI graph. In mo- 
mentum space, it has an integral over the loop momentum, [ d?p. It will diverge 
if the integrand (before introducing Schwinger parameters) goes as p~? or slower to 
infinite momentum (UV limit). In this limit we can ignore masses. If we differentiate 
this graph with respect to any of the external momenta, it will become more conver- 
gent, since the power of momenta in the integrand decreases. (The numerator of the 
integrand is a polynomial, while each factor in the denominator depends on the loop 
momentum.) With enough derivatives, it becomes convergent. This means that the 
divergent part of the graph is a polynomial in the external momenta. Similar remarks 
apply to any 1PI graph, if we consider the divergence coming from letting all loop mo- 
menta go to infinity, known as the “superficial divergence”. Of course, the superficial 
divergence is also polynomial in the coupling constants, as is the graph as a whole; 
but the superficial divergence is also polynomial in the masses, since differentiation 


with respect to them has the same effect as with respect to external momenta. 


We can determine several more properties of this local, but divergent, contribution 
to the effective action. First of all, it is Poincaré invariant and invariant under all 
global symmetries of the classical action, since the effective action is invariant for all 
values of D, so poles in D are also. (Consider, e.g., contour integration in D to pick 
out the pole.) If we use the background field gauge (or consider only Abelian gauge 
fields), then it is also gauge invariant. (However, possible exceptions are conformal 
invariance and invariances involving y_;, since those are not invariances of the classical 
action for all D.) The other property we need is that the coefficient of the divergence 
is real. This follows from the fact that the S-matrix satisfies unitarity, as preserved 
manifestly by dimensional regularization. As discussed in the previous subsection, we 


have unitarity 
SS=1, S=I+iT = i-DN=TT 


As we saw in subsection VC6, this identity actually can be applied to a single graph, 
where the element of J on the left-hand side of the equation is that graph, while on 
the right-hand side the summation over intermediate states gives a sum where each 
term divides the graph into two parts, one for J and one for J'. We then see that 
at any loop the imaginary part of a 1PI graph in 7 is given by “sewing” together 
diagrams from lower loops. This means that any new divergence at any number of 


loops must be real, since sewing doesn’t introduce new (UV) divergences: Sewed lines 
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are on shell, and phase space for on-shell states is always finite. (The 3-momentum 
of each state is bounded by the energy, and each positive energy of an outgoing state 


is bounded by the total energy of the system.) 


Since Poincaré and gauge invariances, locality, and semiclassical unitarity were 
used as properties to determine the classical action, this suggests that the divergent 
terms in the effective action might all be of the form of terms already in the classical 
action. Such a property is called (perturbative) “renormalizability”. When it holds, 
all infinities can be absorbed by a redefinition of the coupling constants (and masses) 
appearing in the classical action. This is physically important because all infinities 
are defined only up to finite pieces: For example, in dimensional regularization, we 
saw in subsection VITA3 that the D-dependent normalization of the classical action 
is ambiguous, resulting in an ambiguity in the finite pieces left over after subtrac- 
tion of 1/e terms. Since we now know that superficial divergences are local, we see 
that renormalization can produce arbitrary finite, local terms in the effective action, 
corresponding to the divergent terms. But if the divergent terms are all of the same 
form as in the classical action, all such finite terms can be absorbed by a redefintion 
of the coupling constants. On the other hand, if such finite terms did not already 
appear in the classical action, we would be forced to introduce them, to make the 
renormalization procedure unambiguous. (Of course, we could give an unambiguous 
prescription by definition, but from the point of view of another prescription this 
would be the same as including the extra terms in the classical action, and using the 
first prescription to arbitrarily fix the nonzero values of the couplings.) Thus, the 
condition of renormalizability is necessary to prevent the appearance of an infinite 
number of coupling constants, which would result in the loss of predictability. (If 
divergences require only a finite number of such couplings to be added, we simply 
include those, to obtain a renormalizable theory with a number of couplings that is 
finite, although larger than that with which we started.) 


Since dimensional analysis determines the form of the divergent terms of any 
momentum integral, it also determines which theories are renormalizable. By appro- 
priate rescaling of fields by constants, write the classical action in a form where the 
derivative parts of the kinetic terms have no dependence on any couplings. Then de- 
fine the couplings to be the coefficients of the interaction terms. It is easy to see that 
the renormalizable theories are the ones that include all terms which satisfy all the 
properties required of the classical action (including preservation of all appropriate 
symmetries that are manifestly preserved by dimensional regularization), where all 
couplings have engineering dimensions that are nonnegative powers of mass: Consider 


first the case where all couplings are dimensionless (and there are no masses, or at 
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least we ignore them at high energies for purposes of considering UV divergences). 
Then the theory is renormalizable simply because there are no dimensionful parame- 
ters around, so any local term must be of the form of those originally in the classical 
action. If we now introduce couplings with positive mass dimension, then perturba- 
tively they can occur only to nonnegative power in any diagram, so any divergence 
thus produced again has a coefficient with nonnegative mass dimension. Since the 
fields themselves have positive mass dimension, there are only a finite number of such 
terms possible. On the other hand, if we were to allow couplings with negative di- 
mension, then terms with arbitrarily high powers of such couplings would also allow 
arbitrarily high powers of fields, and thus lead to nonrenormalizability. (By similar 
arguments, theories with only couplings of positive mass dimension, called “super- 


renormalizable”, can have divergences only to a certain finite number of loops.) 


In particular, in D=4 the derivative part of the kinetic term for bosons is of the 
form [ d*x ¢00¢, and for fermions [ d*xz WO), so bosonic fields have dimension 1 and 
fermions 3/2. That means that bosons can appear only quartically and fermions only 
quadratically. More specifically, renormalizable theories can only have terms of the 


form 
b, 0°, &°, b*, GOO, GG, d304; Y?, VAY; oY? 


(where each @ can be any boson with any spin, and each w any fermion). There can 
also be constant terms (field-independent), which we always drop, since they don’t 
contribute to perturbative amplitudes (after appropriate normalization). The terms, 
and their relations, are restricted by Lorentz, gauge, and internal symmetries. The 
potential for scalar fields must also be bounded from below, to allow the existence of 
a vacuum (state with lowest energy); otherwise nothing would be stable, continually 
decaying into states of lower energy (i.e, the energy of the scalars converting to other 


particles): Thus, ¢? terms for scalars requires also ¢* terms. 


Spin 1 can’t couple minimally to spins >1. (One way to show this is to covari- 
antize the general field equation of IIB1 to S,°V, + kV, and show the commutator 
algebra of this constraint, and 11+ ..., doesn’t close unless the spin <1 or the ex- 
ternal field strength vanishes.) Furthermore, gauge invariance for spins >1 prevents 
them from having renormalizable gauge couplings in D=4: For example, we saw in 
subsection IIIA4 that spin-2 (gravity) couplings include terms of the form ¢0¢0¢. 
Renormalizability therefore restricts us to spins 0, 1/2, and 1. Using Poincaré and 


gauge invariance, the most general action is then of the form 


L = tr { beF? + VIVasba + (V0)? + V(6) + Y%(O+ va + hic} 
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where all group matrices are implicit: They may appear in all fields, in m, and even 
in g, which has independent values for the different factors of the Yang-Mills gauge 
group. Also, the matrices may differ for the same field in different terms: A (in V) 
has different Yang-Mills representations on different scalar and spinor fields, and ¢ 
appears with some matrix in its Yukawa coupling w¢éw. (Of course, all matrices must 
be chosen consistently with gauge and global invariances.) The potential V(@) is no 
higher than quartic. Note that the Higgs mechanism is required to give nonabelian 
gauge fields mass: A? is not gauge invariant, and (V¢)? is the only way to dress it up 
with scalars in a renormalizable way. (For the Abelian case we can use Stiickelberg 
fields, with V = 0+ mAT as in subsection IVA5. In the nonabelian case, introducing 
scalars by a gauge transformation, as for Stiickelberg, results in a nonrenormalizable 
(e-*®Ve'*)? term.) If we ignore gauge invariance, the A? term produces unitary-gauge 
propagators with bad high-energy behavior (see subsection VIB3), which leads to the 


same nonrenormalizable behavior in the absence of a Higgs mechanism. 


Exercise VITA5.1 
Show by power counting that interacting renormalizable theories with poten- 
tials that are bounded from below exist only in D<4. Show that for D<2 
there are an infinite number of possible renormalizable terms in the action. 


What are the kinds of renormalizable terms possible in D=3? 


Exercise VITA5.2 
Superrenormalizable theories aren’t realistic, but they give oversimplified ex- 


amples of many quantum features of field theory. 
a What theories are superrenormalizable in D=3? Show that the only super- 
renormalizable interaction in D=4 is (scalar) ¢°. 


b Let’s do power counting (dimensional analysis) for 4D ¢° theory. Write the 


action in the form 
S= 2 f de (-4o(0- majo +68 


so g? counts loops. (¢ — g@ gives the form where g counts vertices.) What 


are the dimensionless terms 
AS = yl fac ¢” 


for all n (including the vacuum bubbles n = 0; of course, L > 0)? Since super- 
ficial divergences are polynomial in everything (fields, momenta, couplings, 
masses), this gives the maximum number of loops L(n) for a superficial di- 
vergence to appear in an n-point 1PI amplitude. Make a similar analysis for 
3D ¢* theory. 
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c Find all the divergent 1PI diagrams in 4D ¢? theory. (Hint: There are 5, 
excluding the meaningless 1-loop graph with no vertices.) Which of these 
are local, and thus can be completely renormalized away? What kind of 


counterterms are required for the remaining graphs? 


Such global and local symmetry requirements can also be applied to the effective 
action. In background-field gauges I’ is gauge invariant (see subsection VIB8), which 
restricts the form of the effective potential, and even nonlocal terms. In QED charge 
conjugation, in addition to switching 2-spinors of opposite charge, changes the sign 
of the electromagnetic potential: Consequently, any pure-A term in I’ must be even 
in A’s (“Furry’s theorem”). Such classical symmetries can be applied at the quantum 
level only in the absence of “anomalies”, quantum violations (discussed in chapter 
VIII below). However, even the anomalies themselves are restricted by symmetries: 
Anomalies occur only in symmetries that can’t be manifestly preserved by regular- 
ization, which means only conformal or axial symmetries. Thus, when the couplings 
of gauge vectors are parity invariant, the axial anomaly (which violates parity by 


definition) is irrelevant. 


6. Infrared divergences 


Although ultraviolet (UV) divergences represent a serious problem, in the sense 
that they strongly restrict which theories can be useful, and require renormalization, 
infrared (IR) divergences are merely a consequence of poor semantics: The definition 
of the S-matrix assumes the existence of well-defined one-particle asymptotic states. 
Unfortunately, these do not exist when massless particles are present, even in classical 
mechanics: (1) Any particle can be accompanied by an arbitrary number of massless 
particles with vanishing 4-momentum, and such a collection of particles can be indis- 
tinguishable from the lone particle if the (measured) quantum numbers are the same. 
(These are physical states, since p’ = 0 — p? = 0.) (2) Any massless particle can 
be indistinguishable from an arbitrary number of massless particles, with the same 
total 4-momentum, and each with the same sign energy, if their 4-momenta are all 
proportional, since then they are all traveling in the same direction at the same speed. 
(This situation is not important for QED, since the photon can’t decay directly into 


two photons.) 


Experimentally, because detectors have finite accuracy, in the first case there 
can be such “soft” particles with total energy below some small upper limit, and in 
the second case there can be such “colinear” particles within some small angle of 


resolution. In principle this means we should change our definition of asymptotic 
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states accordingly; in practice this is too complicated, but to any particular order 
in perturbation theory only a finite number of such additional massless particles will 


couple. The procedure is then to 


(1) infrared regularize the S-matrix amplitudes (by dimensional regularization, or 


introducing masses for all particles, or keeping massless particles off-shell); 


(2) calculate probabilities/cross sections, including contributions from soft and colin- 
ear particles, as a function of some upper limit on their energy /angle (representing 


the experimental accuracy); and 
(3) remove the regularization. 


No infrared renormalization is necessary. (Examples will be given in later sections. 
Of course, for total cross sections, all energies and angles are integrated over any- 
way.) In general, such a procedure must be applied to both initial and final states 
(the“Kinoshita-Lee-Nauenberg theorem”), but in QED (as opposed to QCD) it is 


sufficient to treat only the final ones in cases of physical interest. 


The reason why infinities appear in cross sections if we ignore this careful pre- 
scription, and in S-matrix elements in any case, is the long range of forces mediated 
by massless particles. A cross section, although it represents a probability, is nor- 
malized in such a way that it is an area, representing the effective cross-sectional 
area of a particle being targeted by another particle. The range of the interaction 
sets the scale of this area; this is related to the mass of the particle mediating the 
interaction. Since massless particles produce infinite-range forces, the result is infi- 
nite cross sections. One might expect that these infinities would appear only in total 
cross sections, where the momenta of particles in the final state are integrated over. 
However, by the optical theorem, this total cross section is given by the imaginary 


part of an S-matrix amplitude, which thus must also have this infinity. 


The fact that these infrared divergences are physical also follows from the fact 
that these kinematic situations occur in classical mechanics: In subsection VC8 we 
saw that physical singularities occur in S-matrices for momenta that are allowed 


classically. 


Exercise VITA6.1 
Consider 2—+2 scalar scattering at the tree level (as in the example of subsec- 
tion VC4): 
a Evaluate the total cross section with all particles massless, and show it has 


an infrared divergence. Relate this divergence to a classical situation. 
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b Do the same with external masses MW and internal mass m, where there is 
no divergence. Find first the explicit general result, then look at the limit 
m <s—4M?. 
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We now give some explicit examples of the evaluation of S-matrices and contri- 
butions to the effective action — momentum integration, regularization, and renor- 


malization — and some examples of their application. 


The simplest examples of dimensional regularization are one-loop “tadpoles”, 


1. Tadpoles 


graphs with only one external line. By the Schwinger parameter method described 
in subsection VITA2, we find 


Ai(z, m?) _ fa ikea 1 5 a a ae 7 D/2—-(rm? +a? /7)/2 
0 


a(k? + m? 
Further evaluation requires Taylor expansion in x (which we’ll need anyway to eval- 
uate a specific integral of k...k/(k? + m?)): 
ass ri-2-n) 
_ 1/_1,,2)n 2 

Al = » ni(— 32") (4m2)1-DP-n 
The mass dependence, as well as the argument of the I’ function, are as expected by 
dimensional analysis: [ d?k k?"/k? is ultraviolet divergent (large k) for D > 2(1—n), 
and infrared divergent (small k) in the limit m —0 for D < 2(1—n). The ultraviolet 


divergence is reflected in I'(z), which has poles at the nonpositive integers. 


To analyze the massless case, we evaluate the 7 integral for D < 2(1—n) and 
m > 0, where it is finite and well-defined, analytically continue to the region Re D > 
2(1—n) (but not exactly at the points where D is an even integer), take the limit m > 
0 there, and finally analytically continue this vanishing result to all D. Therefore, all 


massless tadpoles can be taken to vanish in dimensional regularization: 


or more generally 


B. EXAMPLES 463 


This includes negative a, particularly integrals of polynomials of momenta. Such 
an integral can result from “measure factors”, as discussed in subsections VA2 and 
VC1: For example, if an auxiliary field appears in the action with its quadratic term 
multiplied by a function of other fields, then functionally integrating it out of the 
action results in a functional determinant (in addition to replacing it in the classical 
action by the solution to its field equation). This is represented in terms of Feynman 
graphs as one-loop diagrams whose propagators are all those of the auxiliary field, 


namely 1. The result is then regularized as 


[avi~ so) fakr=o 


consistent with the fact that such factors would cancel corresponding factors we should 
include in the functional integration measure. (In other words, since we can always 


arrange to have all 6(0) factors cancel, we ignore them.) 


On the other hand, massive tadpoles contribute both divergent and finite pieces 


under minimal subtraction: For example, for D = 4 — 2e, 


1 
2) _ = — Dy/(1,,2\D/2-1 
Ax(O.m?) = f dk ray = TU 5 )(gm*) 
= —gm’{2 + [-y+1—In(gm’)]} 
(see exercise VILA2.3b), using A~* = e~* 4), The y can be killed by using an MS or 
G scheme (see subsection VIIA3): At 1-loop order any version of those schemes has the 
effect of just canceling the y (but differences appear at 2 loops: see subsection VIIB8 


below). To include the jz dependence of the coupling, we just replace everywhere (see 


also subsection VIIA3) 
1.9 m* 
In(gm-*) — In (=) 
(and similarly for any momentum factors such as In($p”) that might appear more 
generally); effectively we are using units 4? = 1. Note that we are not allowed to 
Taylor expand in m: Doing so before integration would give an incorrect result; after 


integration it’s impossible. Similar remarks apply to the exponential e*** in A, if we 
interpret it as the definition by Fourier transformation of the propagator in position 


space. 


Exercise VIIB1.1 
Find the 2D massless propagator in position space by Fourier transformation. 
(But don’t Taylor expand in x.) Note that this Fourier transform is infinite, 


and requires “renormalization” (of a constant of integration). Compare this 
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with the result obtained by solving the integral form of the Klein-Gordon 
(Laplace) equation (i.e., Gauss’ law in D=2). 


Two-loop tadpole integrals are not much more difficult if one line is massive, or 
two are massive with the same mass. (Again, tadpoles with only massless lines can be 
taken to vanish in dimensional regularization.) If two propagators are massless, then 
they can be treated first as a one-loop propagator graph: By dimensional analysis, 
the result of that one-loop subintegral must be a power of the momentum squared. 
(The explicit result will be calculated in subsection VIB4.) We therefore consider 
more general one-loop tadpole integrals with more complicated propagators that may 
result from subintegrations in a higher-loop graph. For example, we consider 


(a) 
Ai(a,x,m? = fa a 
ia E+ map 
Using the definition of the ’ function, we can write 
I'(a) _ i. dr pa-le—T(k? +m?) /2 
[s(k2+m?)J@ Jo 


Performing the resultant Gaussian momentum integration and Taylor expanding in 
x, we easily find 


= 1( he 
(a,x, m* =o ala" (4m?)o- D/2—n 
n=0 


A more complicated example is 


7 — I(a) I(b) 
Alot) = fa eT rh 


ay dt,dt_ T a 1 qe tf dk [T1k? +72 (k?+m?)]/2 
0 


= drydt, Te 1 ro Mga ee ere 
0 


We then introduce a scaling parameter \ (also described in VITA2), scaling 7; = Aa; 


in the insertion 


ti an sX-n-n)= f dr d~16(1 — a4 — a2) 
0 


0 
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and integrating the 6 over a2 to get a2 = 1— a,. This gives (with a; = a) 


1 CO 
Ax(a, b, m?) -|/ da at t(1— a) | d) \ate-D/2-1,-A1—a)m? /2 
0 0 


P(a+b-2 
= Toni a) B(a, 2 —a) 
(Em?)e-D?2 
When two of the propagators in the two-loop tadpole graph have the same non- 


vanishing mass, we consider directly the two-loop integral 


Pa) P(b) P(c) 


Mite bogs = f aa pe ee 
tala Bom)= f dedhs Te kale Re + me) TE + my 


(This integral also represents the physically less interesting 2-loop “vacuum bubble”: 
no external lines, and thus field independent.) Introducing the Schwinger parameters 


and performing the momentum integration, we find 
| Br ee rots + 11(T2 + Pear 2 
0 


where we have included the power of det A for 


Ae (* am T1 ) 
T Ty 73 
Since T, does not appear in the exponential we integrate over it first directly, using 
the second integral form for the Beta function, from exercise VITA2.2c. Then 7, and 
T3 can be handled by introducing a scaling parameter for them only, leading to the 
previous types of integrals. The result is then 


rat+b+6e—D) 
(gm2 jeer 


Ax2(a, b,c, mm") = B(a+b-,a+c—9)B(a,2 ~a) 


2. Effective potential 


A propagator in an external field represents a certain class of Feynman tree dia- 
grams. Thus, some tree graphs can be described by quantum mechanics. (In principle 
this means we can start from classical mechanics and first-quantize, by either operator 
or path-integral methods. However, as we’ll see in chapter XII, in practice we save 
some effort if we start directly with the quantum mechanics.) If we take the ends 
of such a propagator and sew them together, we can describe arbitrary 1PI 1-loop 
graphs by the background field method. While tree graphs describe classical field 
theory, one-loop graphs contain many of the important quantum properties, partly 


because they are the lowest-order quantum correction, and partly because they are 
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associated with the functional determinant part of the (second-quantized) path inte- 
gral. (In terms of the exponent, classical is the only negative power in h, 1-loop is 


h-independent, and higher loops are positive powers.) 


In quantum mechanics, the expansion in f is an expansion in derivatives (since 
it appears only as pz = —ihO,). In terms of the contribution of one-loop graphs to 
the effective action, this means an expansion in the number of derivatives acting on 
the fields. This definition can be applied in general in quantum field theory, without 
reference to quantum mechanics. However, the simplest one-loop calculations of this 
expansion are most easily expressed in quantum mechanical terms. In practice, this 
means expanding the external fields in 2 about some fixed point, expanding the 
exponentiated (by a Schwinger parameter) propagator about the part Gaussian in p 
and x, and using any of the usual methods to exactly evaluate the matrix element of 


a polynomial times a Gaussian. 


Since we generally want arbitrary orders in a field and some of its lower derivatives 
for this method to have any advantage over the usual diagrammatic methods, in this 
approach one generally cuts off the expansion at the approximation that gives just 
the Gaussian. This means we can keep up to two derivatives of an external scalar, but 
only a constant field strength for an external gauge vector. (See subsection VIB1.) 
The simplest, and most useful, example is a constant scalar field. The part of the 
effective action that consists of only scalars without derivatives is called the “effective 
potential” , since it generalizes the potential term of the classical action. This potential 
determines the quantum corrections to spontaneous symmetry breaking and the Higgs 


effect, and this is important for describing mass generation for all spins. 


Consider a complex scalar running around a loop, under the influence of an ex- 


ternal real scalar. The Lagrangian is 
L=p*|3(-O+ m*) + lb + Le 


where the form of Lg won’t be important for calculating the ~ loop. A constant 


external scalar field is effectively the same as a mass term, modifying m? — m? +4 2¢. 


Thus the effective potential in this case can be evaluated by summing tadpoles: 
V=—Sou-ae fap (ye? + my 
n=1 
for our complex scalar; for a real scalar running around the loop there would be an 


extra factor of 1/2. We can integrate before summing: 


——— Perr. ss z) (4m2)-"+P/2 


n=1 
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Using the identities (from Taylor expansion in a/b, and (z+ 1) = zI(z)) 


(a+b => (‘) cia (') ~ aera) - Waray 


n=0 
we have 
V = PCB) [(gm? + 6)? — (hm)? 


We can also integrate after summing: Using the identities 


° d . nm np—-n 
infa+b)—Ind= f —- fem geen "(5 = —S +(-1)"a"b 


ie 2 = [aw f dp ere) -f- dr (eer ne) 
o U T 


we have 
Ve -f- dr = [av ( en Tlb+(p?+m?)/2] err enty/2) 


which gives the same result. 


For D=4, we find (after subtracting divergent counterterms, and some correspond- 


ing finite pieces, corresponding to a MOM type of subtraction) 


V = 3(gm? + ¢)7ln (1 + =) 
Since this modifies the classical potential, it demonstrates that quantum effects can 
generate spontaneous symmetry breaking where there was none classically, or vice 
versa (the “Coleman-Weinberg mechanism” ). 


Exercise VIIB2.1 


Generalize this renormalized result to arbitrary even dimensions. 


For more complicated cases we need a more general procedure: The basic idea 
is that any Gaussian integral gives a (inverse) determinant, of which we must take 
(minus) the logarithm for the effective action, and we use In det = tr In. (The trace 
includes integration over x or p.) After subtracting out the field-independent part 
(vacuum bubble), this gives an expression as above: For a general kinetic operator 
H = Ho +... (generally Hp = $(p? + m?)), we want 


d 
T= — [tr In(H“") - tr n(A --f- a = fax ( (gle “7 ~e-""|z) 


H (and Ho) is now treated as an operator, in terms of the coordinate operator X 
and momentum operator P, and X|x) = z|x). External fields depend on X, but are 


Taylor expanded about «: e.g., 


P(X) = bx) + (X — x) - 06x) + 
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We then can use translation invariance to write 


(ale- THIF. X—2,6(@)]| = (Oje-THIP-.9@)]/Q) 


When 4 is quadratic in P and X, we can use (see exercise VA2.5) 


ale" ly) = | det 2 


where S' is the classical “action” corresponding to the “Hamiltonian” H. (Further 


examples will be given in subsection VIIIB1.) 


Note that the (one-loop) vacuum bubble, with no background fields of any kind, 
must always be dropped, as it is totally meaningless (although how it is subtracted 
may be regularization dependent): In terms of the graphs summed here, which have 
equal numbers n of propagators and vertices (P—V = L—1 by the usual h counting), 
it is the term n=0. Thus, in a coordinate space calculation, where there are also n 
integrations d?x, this term would have no propagators, no vertices, and no integrals 
(contrary to some statements in the literature, where this graph is misidentified as 
a one-propagator graph with one integration). All that remains is the permutation 
factor, 1/n, but in this case that is an undefined 1/0. 


In actual applications, closer examination reveals the used graph to be the cut 1- 
loop tadpole (P = V = L = 1). Since the cut propagator gives a sum over states, the 
result is to evaluate the trace of the operator inserted at the vertex; in particular, a 
trivial vertex yields str(I), i.e., the number of states, bosons minus fermions. Similar 
use can be made of the cut propagator correction (P = V = 2) for (super)traces of 


operator products or mass sum rules. 


3. Dimensional transmutation 


The 2D version of the CP(n) model described in subsection IVA2 is an interesting 
model in that it demonstrates generation of bound states at the one-loop level. Its 
Lagrangian is: 

L = V6)? + A(lol? — 5) 


where g is now dimensionless. For the effective potential for the Lagrange multiplier 


A from a @ loop, we find (modifying the calculation of the previous subsection for 


D=2) | 
vi= A In (2) “1 
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after including the renormalization mass scale yz to make the argument of the loga- 
rithm dimensionless, and the coupling dimensionless in all dimensions. Note that we 
have effectively added a mass as an infrared regulator, then taken it to 0 at the end. 
Equivalently, we treat A itself as a mass, as mentioned in the previous section. We 
would miss this with dimensional regularization, which is OK for ultraviolet diver- 


gences, but has some difficulty with infrared divergences, because of their nonlocality. 


Now the coupling can be absorbed into the definition of this scale: Adding to the 


classical term Vj = —A/g?, the total effective potential for A up to one loop is 


vale (Ge) --alo(Gin)~ 


where M is the “renormalization group invariant mass scale”: 


M2 — pe V9 


Since this was the only place the coupling g appeared in the action, the mass scale MW 
has now replaced it completely. This replacement of a dimensionless coupling (g) with 
a dimensionful one (M) is called “dimensional transmutation”. It is also a common 
feature of quantum high-energy behavior (see below); its importance at low energies 
depends on whether the classical theory already has dimensionful parameters (like 


masses). 


Varying the effective potential to find the minimum, which we identify as the 
(quantum) vacuum value of the field A, 


i \ ao = ii 
(si) _— 


Because A has a vacuum value, @ now has a mass (as seen by expanding A about 
its vacuum value). Furthermore, since A now has more than just linear terms in 
the effective action, it is no longer a Lagrange multiplier. In fact, by calculating a 
massive @ loop with two external A’s, we see that A is now a massive physical scalar 
also. Without a Lagrange multiplier, ¢ is now unconstrained, so it has an additional 
physical degree of freedom. This leads to a restoration of the spontaneously broken 
U(N) symmetry. This is related to ¢ gaining mass, since we no longer have Goldstone 
bosons associated with the symmetry breaking. Finally, if we calculate a massive ¢ 
loop with two external gauge vectors, we see that at low energies there is an F? term, 


so A is now a physical, massive vector instead of an auxiliary field. 


Exercise VIIB3.1 
Expand V about (A) to show that A gets a mass term. Expand A(z) to 
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quadratic order in x according to the prescription of the previous subsection to 
calculate the effective action in terms of A, 0A, and 00A (using the harmonic 
oscillator result of exercise VA2.5) to show that a ADA term is also generated, 


so A becomes propagating. 


4. Massless propagators 


For the massless one-loop propagator corrections, we also introduce a scaling 
parameter to convert to Feynman parameters (see the examples of subsection VIIB1, 


or the general method in subsection VI[A2), with the result 
1 
3(k + 3p)?3(k — 3p)? 
= [ CO mea [ dayday 6(1 — ay — ag) x 
0 0 


x exp{—Azp"(1 — (a, — a2)"| — is (ay —a)p-x— 7" 527} 


A2(x, p”) = | ak eee 


Making the change of variables 


a =3(1+6), a= 5(1-8) 


the amplitude takes the form 


1 00 
Az = | dp 4 (etP2/2 ~ aay) dX M~PPeap[—Ad(1 — 8?)p? — \** $27] 
0 0 

The integrals can be simplified if we make use of gauge invariance: For example, 
the electromagnetic current for a complex scalar is of the form ¢* de, so the gauge 
field couples to the difference of the momenta of the two scalar lines, which is 2h for 
the above as applied to the scalar-loop correction to the photon propagator. On the 
other hand gauge invariance, or equivalently current conservation, says that such a 
vertex factor should give a vanishing contribution when contracted with the external 


momentum, which is p in that case. Checking this explicitly, we do in fact find 


k-p / 1 1 
———— Se ea ee 
/ a(k + 5p)?5(k — ap)? x(k — 3p)? x(k + 3p)? 
(even with an arbitrary additional polynomial factor in the numerator), using the 


facts that the integral of the sum is the sum of the integrals when regularized, and 
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that massless tadpoles vanish. (This also tells us that replacing the numerator k - p 
with k? + <p gives 0. Furthermore, without an extra numerator factor the integral 
vanishes by antisymmetry under k — —k.) Thus, if x is proportional to p in Ag, 
the only contribution is from the x = 0 term in the Taylor expansion. Since then 
p- (O/0x)A2g = 0, it depends on only the “transverse” part of x. This implies that 


the dependence on x is only through the combination 
= (p- a)’ — pra’ 
so we can evaluate the integral by either of the substitutions 
30, produ or p:-2—70, 2? —-u/p’ 


We'll consider now the latter choice. (The former gives the same result: See the 
exercise below.) Again, since we need to Taylor expand in x anyway to find the result 


for a particular numerator, we expand and perform the A integration: 
lee) n 1 
u n - n - 
As _ > + (=) (4p*) +D/2 ue = DB _ n) [ dB (1 _ Bp") +D/2-2 
n=0 0 
Performing the change of variables 3? = y to convert the remaining integral to a Beta 
function, and using the identities 


(3) =v, I'(z)P(1 — z) = 7 esc(1z) 


(see the exercises in subsection VIIA2), the final result is 


oe) 


A» _ —tn*ese(DE)( t p° aa 2 : 
ae. +2- ) 


From the csc factor we see the integral is divergent for all even D: These are ultraviolet 


{gle = = (pa) |)" 


n=0 


divergences for D > 4 and infrared ones for D < 4; dimensional regularization does 
not carefully distinguish between the two, although the difference can usually be told 
by examining momentum dependence (here from the exponent D/2— 2). Also notice 


that the two can be mixed up by the conversion to Feynman parameters. 
Exercise VIIB4.1 
Evaluate the general massless one-loop propagator correction using x? — 0, 
pa Ju. 
a Show it gives the same result as p- 2 — 0, x? — —u/p? by using the I and 
B identities in subsection VITA2. 
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b Show it can also be written as (for convenience of expansion about D=4) 


r(n+ 2-1) 
Ay = (Ap?) P/2-2 p(B Pr D . Z 
2 = (9p") (y- DL 2, nil (2n + D — 2) 


(ies ipa 

As discussed in subsection VIIB1, sometimes certain subdiagrams of higher-loop 
diagrams can be evaluated explicitly, particularly propagator corrections that them- 
selves involve only massless propagators. Furthermore, such a formula might be used 
recursively in appropriate diagrams. For example, a higher-loop diagram that is itself 


a propagator correction might reduce, as a final integration, to something of the form 


Fa) _F)__Pa+b-z) a0 4p 
fa Ge Het ppp Gpyaee Ba 29) 


again using the above methods, finding similar integrals to the previous. 
Exercise VIIB4.2 


Let’s examine this integral more carefully. 


a Evaluate it in two different ways: first, by the method used above; second, by 


Fourier transforming each factor using 


ikea Pa) its =a) 
joo gh- fee 


(Ek2)2— ( 172)D/2—a 
(derive this also) and its inverse, simply multiplying the resulting factors in 
x space, and inverse transforming. 


b Show that the MS scheme cancels y’s and ¢(2)’s in iterated massless propa- 


gator corrections to all orders in € by examining 


1 
1 ——— 
r( 2 ) / (Sk2)mtlhie(s(k lL. p)?|n2+L2e 


where L; are the numbers of loops in the propagator subgraphs (show this by 
dimensional analysis) and n; are other integers. Show the G scheme does the 


same. 


Exercise VIIB4.3 


Calculate the “phase space” for n massless particles 


= d?-1p; )P/25D 
ve | Magre, (2n) 2/25 ( -¥) 


where p is the total momentum of the n particles, by using the optical theorem: 
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a Consider the scalar graph with n massless propagators connecting 2 vertices. 
Show, both by induction in the number (n— 1) of loops, and by Fourier trans- 
formation (as in the previous problem), that this graph (for distinguishable 


particles) gives 


BOR ee 
Pin(2 — 1) pyre 


b Wick rotate back to Minkowski space (p? < 0) and take (twice) the imaginary 


part to obtain the result for continuous real D > 2 


ee ek ie ad 
Tin(2 — Dr [(n—- (2 - Wh 3D’) 


(Hint: (3p? = ie)” = (—3p*)"e*".) 


5. Bosonization 


A common method in field theory is to consider simpler models where calculations 
are easier, and see if they are analogous enough to give some insight. In particular, 
two-dimensional models sometimes have perturbative features that are expected only 
nonperturbatively in four dimensions: For example, we saw in subsection VIIB3 
the generation of bound states at one loop in the 2D CP(n) model. Of course, 
some of the features may be misleadingly simple, and may have no analog in D=4. 
Two-dimensional theories, especially free, massless ones, are also useful to describe 
the quantum mechanics of the worldsheet in string theory (see chapter XI). In this 
subsection we consider free, massless 2D theories: Essentially, this means just the 
scalar and the spinor, since there are no transverse dimensions to give gauge fields 
nontrivial components. 

Spinor notation is very simple in D=2, since the Lorentz group is SO(1,1)=GL(1). 
For that purpose it’s convenient to use lightcone notation. 2D y matrices can be 


chosen as 


w=) HHS) MAH T= OP) =vin 
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In general, even-D y matrices can be constructed as direct products of D/2 sets of 
2D 7 matrices, so tr(I) = 2/?. (For details, see subsection XC1.) 


The Lagrangian for a massless, complex spinor can be written this way as 


L = Wigv = bo(—-t0e0) Pe + Po(—t0ee)b 


(This also follows from truncation of 4D spinor notation.) Note that ws and Wo 
transform independently under proper Lorentz transformations, as do their real and 
imaginary parts. Thus, we can not only impose a reality condition, but also a chirality 
condition, dropping Ws or We: A single real component is enough to not only define 


a spinor Lorentz representation, but also construct an action. 


Upon Wick rotation to Euclidean space, the lightcone coordinates become com- 
plex conjugates of each other. These complex coordinates are convenient because they 
are still null coordinates, and their derivatives occur separately in massless fermion 
kinetic operators (and Lis just their product). For later application to string theory 


it will prove convenient to avoid some /2’s, and define 


229° 24x, z= 9° = 7g! 
= da dz dz 
1 = ——— 1 2 = ) A 
=> 5X] = 200, ae eae 0°(o) = 216(z)d(Z) 


where 0 = 0/0z and 0 = 0/02. (The sign for dz dz depends as usual on order of 


integration. ) 


The action for a real scalar is then 


_ dz dz _aige, 
s= | Oni L, L = 539(-00)¢ 


For a chiral spinor we then use either of 
L=wdw or wow 


where the 7 from the usual energy operator —i0p has been absorbed into the normal- 
ization of the fermions for later convenience. (Reality is funny anyway in Euclidean 


space: 0 vs. 0; see section XIB.) 


In Euclidean position space, the propagator for a massless scalar is (see exercise 
VIIB1.1) 


aa ~ #') = —Inf(a — a’)"] = —Inf(z — 2/2 — 7) 
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up to areal, dimensionful constant: We use units = 1. The propagators for massless 
spinors are then 

1 
z—2! 

1 


ZZ 


52mis(2 =-£62-7)=8 Wie -HE—7= 


1 : ! = cI) _ dy IN (Zs = \) — 
gemid(z — 2)0(Z — 2) = Inl(z — z)(2— 2) = 


/ 


We then find 


pe 6" Sadie) 
z 


z 


The apparent inconsistency of this result is resolved by noting the € prescription 


for the Euclidean spinor propagator: If we regularize 
In(zzZ) — In(zz+.e) 


for any € that is not 0 nor negative (i.e., is positive or complex), we find 


i) | 


— — — nny 
z P-pele 
and the wave equation for either propagator is satisfied, where the place of the 6 


function is taken by 


2€ 
275?(c) — (+6? 


whose normalization is easily checked. 


Exercise VIIB5.1 
Show (e.g., by an infinitesimal Wick rotation) that the correct ie prescription 


for the spinor propagator in Minkowski space is 


= = =i 


(x — x’)* — iee(t — t’) a ne 2, 


Gueron ““Gograk 


and that it satisfies the wave equation. (t — ¢’ can be replaced with (x — x’) 


in the above.) 


As a simple example we consider the 2D phenomenon of “bosonization/fermioni- 
zation”, that fermions and bosons can be converted into each other, even when they 
are free. First we examine bosonic currents created from fermions: Taking the product 


of 2 such currents inside the functional integral (as usual, time ordering is implicit), 


1 
zZ— 2! 


J=iby => J(z)J(2)2- ( ) = 00'[-In(|z — z'|?)] © J(z)J (7), J = 00 


oP 


where “” means we look only at the most singular terms as 2’ — z, from using all 


these fields to generate propagators. From this we see that the fermionic current J = 
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ipw (the i is from Wick rotation) has the same “propagator” as the bosonic current 
J = 0¢ (actually just the complete, chiral part of a boson). Thus, by integration we 


can define a “chiral boson” in terms of fermions. 


By being a little more tricky we can do the reverse, define the fermion in terms 
of a boson. Unlike the previous procedure, this would seem to violate statistics, and 
has no classical analog. We start by separating the scalar on shell into its “chiral” 


and “antichiral” parts (which were left- and right-propagating in Minkowski space): 


since 00¢(z,Z) = 0. (This can be accomplished by differentiating with respect to z or 
Z and then integrating back. We use “@” to represent either the full boson or its chiral 
part, which should be unambiguous by context.) The chiral boson has propagator 


(the chiral half of an ordinary boson’s) —In(z — 2’). 


The inverse relation is (quantum mechanically, not classically) 
Bae¥, yal 


You might think a = 1, but this product is singular, as we would expect if we are 


to get the correct propagator. First, we find 
b(z)ei?@) = : [b(z) — i In(z — z)]e*® : 


where this expression is exact, and we have used an explicit normal-ordering symbol 
(14 


: :” to indicate we have already evaluated all propagator terms, even though we still 
have fields at different points. Similarly, 


[o(z) |e) = : [6(z) —é In(z — 2’) : 


= eH) eib2) = . e-tb(2)—Ine-7') 619’) - = 1 » eb) +i9(2’) 
z—2! 


Taylor expanding both ¢(z’) about z, and the exponential, we find in the short 
distance limit / 
: —ib(z) ,ib(2') _ . 
lim €719@) #6) — , — 109(z) 


zl —z Z = 
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The nonsingular, i.e., ¢-normal-ordered (not the same as ¢-normal-ordered), term 
agrees with the current defined above. Note that, in terms of 2D quantum field 
theory, this calculation is a sum of diagrams to an arbitrary number of loops! Thus 
w and ¢ are unrelated classically, and their quantum relation is another example of 
duality: If we stick in the h from the ¢ action as a factor in the ¢@ propagator, the w 
propagator looks like (z — z’)~"*, whose expansion reproduces the powers of In, and 


which becomes 1 in the ¢-classical limit. 


Although this gives the appearance of a scalar being the bound state of spinors, 
and vice versa, even in the free theory, there is a simpler interpretation, even clas- 
sically: Massless particles in D=2 travel at the speed of light in one of two possible 
directions. Thus, a collection of free “left-(or right-)handed” massless particles travels 
along together, not separating, and thus acting like a bound state. (As shown in sub- 
section VC8, singularities in perturbative quantum field theory directly correspond 


to configurations in classical mechanics.) 


A related calculation is for the energy-momentum tensor of the fermions, which 
we'll apply in section XIB to conformal transformations: By taking again the boso- 
nized form of the point-split operator product above, and taking derivatives before 
the short-distance limit, we find 

. = > 1 
iim 510 (z)O'Y(z') — Oo(z) H(z’) = @=ae + 3(06)° 
relating the tensors for bosons and fermions (after ~-normal-ordering away the sin- 
gular term). 


Bosonization extends to massive fermions: The “massive Thirring model” 


L = $e(—i0e0)e + Yo(—i0g@)be + 5(Vebe + Bote) + veloteve 
(in Minkowski space) is equivalent to the “sine-Gordon model” 
L = gal7z(0¢)* + gu"(1 — cos 4)| 


with the above relation between the spinor and scalar fields, and 


1 fi 

(Note in particular the free massive fermion for 3 = 1.) In this case the bound states 
are dynamical. Note that the relation is between strong coupling in one theory and 
weak in the other (“duality”). 
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6. Massive propagators 


Another way to distinguish infrared divergences is by introducing masses (being 
careful not to break any invariances, or restoring them in the massless limit). For 
example, we again evaluate the one-loop propagator correction, without numerator 
factors, but with different masses on the internal propagators. By the same steps as 
before, the Feynman parameter integral is 

Ay (p°, mi, m3) = ja ————————— 
3l(k + 5p)? + mi]5[(k — xp)? + m3] 


1 
=r(2- 2)! / dg BPP, B= 4pP(1— B) + Ami +m) + 490m? — mi) 


Now the ( integral is harder for all D, but the masses eliminate the IR divergences 
(and the UV divergences are already explicit in the I’), so we immediately expand 
about D = 4 — 2e: 


; 1 
A, = r(94 | d3 (1—eln B) 
=i 


We then use integration by parts 


1 
[ ma=omeByp,-f asdne 
-1 


dp 
—b+Vb?—4ac m?—m2+2d 
B= a9? +d8-+e=a(3—8,)(8-B.), B= = Me 
B B G4. B- 
= Bin B = =2+4 + 
dG B-B,  B-B- B-B, B-p 
in terms of Aj2(s) of subsection IA4 for s = —p?. Note that in Euclidean space 
2Ar2 = / (p? + mZ — m5)? + Amp? = 4/ (p? + m3 — m7)? + 4mip? > p? + |myp — m3 


= 26.21 


where the strict inequality holds for both masses nonvanishing. The integrals then 


take the simple form 


i OB i: C= y =) 
[#8 Cae ed a meer fc a 


Putting it all together, 


A, =I(1+6) E —In(mymg) + 2+ 584ln (5) + 38_In (> . 7) 


(We can cancel the [’(1 + €) by nonminimal subtraction.) By analytic continuation 


from Euclidean space, taking p? from positive to negative along the real axis, we see 
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there is no ambiguity at p? = 0 or —(m; — m2)”, and A, remains real until we reach 
p? = —(m, + m2)”, where it gets an imaginary part (whose sign is determined by 
(m, + mz)? > (m, + mz)? — ie), corresponding to the possibility of real 2-particle 


intermediate states. 


Exercise VIIB6.1 


Let’s consider some special cases: 
a Show for equal masses m, = m2 = m that this result simplifies to 


Aa(p?,m?,m?) = P(1 + €) E —In($m?) +2+ 6 In (Fa) 


B+1 
p? + 4m? 
ia aoe a 


b Consider the case with one internal particle massless, m,; = m, mz = 0, and 
find 
Sn 1 2 af m2 2 ae m? 
Ax(p?,m*,0) = I'(1 +e) - —In(hm?) +2- in — 
€ D m 
c Show both these results agree with the previously obtained massless result in 
the limit m — 0. However, note that both these cases, unlike the massless 


case, are IR convergent at p? = —(m, + m2)’. 


Exercise VIIB6.2 
Find the phase space for 2 massive particles, again using the optical theorem 
(as in exercise VIIB4.3): 


a The calculation is easier if one takes the imaginary part before performing 
the Feynman parameter integration: Show the result is then 


= 1 = D/2-2 
Verargay f, eB 


In particular, show from the explicit parameter integral expression for the 


2 


propagator that the only cut is at —p? > (m , + mz)’, as expected from the 


optical theorem. 


b Make the change of variables 


aa Lat 
p= 8, 
to find the result 
r(3-1) = 7 
Ye ee BO 
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which simplifies in D=4 to 
A12 


1,2 


Ve = 29 


c Show this result (in all D) agrees with the result of the explicit phase space 
integral of subsection VC7, using the expression for f d?~?.Q from subsection 
VIA3. (Hint: Use the identity from exercise VIIA2.2b.) 


d Show the massless case agrees with exercise VIIB4.3. 


In subsection VIIA3 we considered the application of the MOM subtraction 
scheme to propagator corrections. We assumed the propagator corrections were 'Tay- 
lor expandable in the classical kinetic operator. From the above explicit expression 
for the 1-loop correction in scalar theories, we see this is possible except near the 
branch point at p? = —(m, + mz)’, ie., when the external particle (whose propaga- 
tor we’re correcting) has a mass equal to the sum of the internal ones. To analyze 
this more carefully, let’s recalculate the propagator correction, performing the Taylor 
expansion before evaluating the integrals. We consider the case with one vanishing 
mass, Mm, = Mm, M2 = 0, to generate an IR divergence. Assuming the external mass 
is also m, we expand around the branch point in p? + m?. The Feynman parameter 


integral is then, to linear order in p? + m?, in terms of a = s(1 + (), 


l-ap?+m?\] 
m2 


1 
A2(p?,m?, 0) = ref da [pmta’ (1 + 


1 1 1 1 p+m 
— 1. 2D\ 6 
me CLs, -—-(S-s) we | 


= 1 1 ptm 
rw (1 + €)(3m?) [(— + 2) +5 (— + 2) a) 
where we have distinguished the UV divergence (in the \ integral for « > 0) from the 
IR one (in the a integral for « < 0). After including the (5y*)* in the coupling, the 
($m?)~© converts each 1/e into a 1/e — In(m?/j:?). (Of course, we can choose pp = m 
for convenience.) Note that this infrared divergence was a consequence of trying to 
Taylor expand about a branch point due to a massless particle. 
Exercise VIIB6.3 
Do MOM subtraction for external mass M = m, + mg, with neither internal 
mass vanishing, and show there is no divergence other than the UV divergence 


of the minimal scheme. 
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Later, we will encounter propagator corrections in gauge theories with massive 
internal lines, and with various numerators. Here, we examine these purely from the 


point of view of the integrals. First, consider 


k; 
Ag = / dk Teo ip awe ipa 
a[(k — gp)? + mi]al(k + 5p)? + m3] 
Since p is the only external momentum for a propagator, by Lorentz invariance we 


have 1 
Aa = Pa—3P 7 A 
P 


so it is sufficient to evaluate the integral of p- A. In analogy with the earlier massless 
expression, we look at 
1 1 _ k-p+4(m3 — m7) 
ak gp)? + mi] S[k + gp)? + m3] Bk — gp)? + milal(k + gp)? + m3] 
from which we find 
2 


—m x a 
Aa = Pa—s > [Aa(p’, mi, m3) ~~ A2(0, mi, m3)| 


in terms of our result we above for the integral without numerator. 
As a more complicated (but important) example, we examine 
Rake 
Ac = / 0 ee 
al(k + 3p)? + m?] 3[(k — 3p)? + m?| 


Following our procedure of the previous example, we note 


(p-k)k 
[a Te + Ep)? + ms[(e — Ep)? + mA 


7 [a k; k 
HUE Boy? om] HCE Spy 
k+t k—i 
= favs Zs ~ T7p2 a =p | ak 5 = 2 
3(k2 +m?) — 5(k? +m?) 3(k? + m?) 
Thus transversality again determines the amplitude in terms of a scalar: 
4 kaky Tab 
Ag = pa eH  — __ 
(E+ gp)? + m?]g[(k = yp)? +m] (RP +?) 
= (NavP” — Papo) A(p?, m?) 
(This amplitude actually will be more useful than A,,.) We also have the identity 


a(k? +m") + 3p" 
[a Tie + Ip) + me - Ip? am 
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1 1 1 
col dk et og = | hoo 
tf oh reat HR? +m?) 
Taking the trace of the previous expression, 
‘ Ke D 
D—1)p2Alr? = fe oo 
( )p° A(p*) L(k + 4p)? + m2]i[(k — ip)? +m]  4(k2 +m?) 


1.2 2 1 = = _ i 
=n!) fa am 9 aa 


= —($p" + m?) Ao(p”, m, m?) ~~ (D ~~ 2)Ai(0, m?) 


in terms of the ¢* propagator and tadpole graphs evaluated earlier. This result can 


be reorganized if we make use of the p = 0 case: 
0 = —m?A,(0, m?, m?) — (D — 2).A;(0,m?) 


(which also follows easily from the earlier explicit expression for A,(a,0,m?)). We 
then find 
»Ao(p?, ee m?) _ A,(0, a, m?) 


A = -—7 An(p”, m?, m?) — am P 


4(D—1) T 
Exercise VIIB6.4 
Check that these results are consistent in the massless limit with the expres- 
sions obtained in the previous subsection, by relating the first two terms in 
A(x, p”) for arbitrary D. 


Exercise VIIB6.5 
Calculate the one-loop propagator corrections for A and A in the 2D CP(n) 


model. 


7. Renormalization group 


An interesting, useful, and simple application of the propagator correction is to 
study the high-energy behavior of coupling constants. For example, we have seen that, 
by a change in normalization of gauge fields A — A/g, gauge couplings can be moved 
from the covariant derivative to the kinetic term: V = 0+igA — 0+iA, Lo = }(OA+ 
igAA)? = a7(OA +iAA)*. Thus, quantum corrections to gauge couplings can be 
found from just the propagator (kinetic-operator) correction. A simpler example is a 
scalar field; a ¢* self-interaction has a dimensionless coupling in D=4, like Yang-Mills. 
However, unlike Yang-Mills, this model has no cubic coupling, and thus no 1-loop 


propagator correction. Furthermore, in Yang-Mills the one-loop propagator correction 
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contribution to the effective action gives a multiloop contribution to the propagator 
itself, from the expansion of 1/( + A). This corresponds to the graph consisting of 
a long string of these corrections connected by free propagators. There is a 1-loop 
4-point correction in ¢* theory, and this graph resembles a propagator correction, 
but with two external lines at each vertex instead of one. Such corrections can also 
be strung together, resembling the Yang-Mills string, but with no free propagators 
inserted. Since all the intermediate states in this graph are 2-particle, it is 1PI, so the 
effect of this string is not contained in just the 1-loop effective action, even though it 


is an iteration of a 1-loop effect. 


OX + >-O-0-0* 


This difficulty can be avoided by introducing the * interaction through an aux- 


iliary field, just as it appears in supersymmetric theories (see subsection IVC2): 


LD 


3¢(-30 + x)b — xx" 


where we have neglected the mass term since we will be concentrating on the high- 
energy behavior. Here the coupling is introduced through the auailiary-field “kinetic” 
term. The diagrams just discussed now appear through the 1-loop correction to the 
auxiliary-field propagator: Since its free propagator is just a constant, it can be 
contracted to a point in these multiloop diagrams. The definition of 1PI graphs 
has now changed, since we can now cut auxiliary-field propagators, which would not 
exist in the usual $* form of the action. This modification of the effective action 
simplifies the analysis of quantum corrections to the coupling, as well as making it 
more analogous to gauge theories. Note in particular the change in interpretation 
already at the tree level: We have used the conventional normalization of 1/n! for 
factors of 6” in the potential, since canceling factors of n! arise upon functional 
differentiation. However, the result of eliminating y from the classical action produces 
=" instead of a". The reason is that in the diagrams with y there are 3 graphs 
contributing to the 4-¢-point tree, corresponding to x propagators in the s, t, and u 
“channels”. (See subsection VC4.) Although this is a trivial distinction for the trees, 
this is not the case for the loops, where the propagator string consists of pairs of ¢ 


particles running in one of these three channels. 


The contribution to the 1-loop effective action for x is then given by the above 


calculations, after including the factors of 1/2 for symmetries of the internal and 
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external lines, and the usual —1 for the effective action: 
L=—4X [Fe Bu=e1—e(=,0) “|x 


where as usual « = 2 — D/2. Expressing the Beta function in terms of the Gamma 


function, and expanding as in previous subsections, 


Renormalizing away the constant pieces, we find for the classical action plus this part 


of the 1-loop effective action 
L+L = 30(-40+ x) — dx | - dln (—3)] x = $0(-$0 + x) + by In (-) x 
where the renormalization group invariant mass scale M is given by 

M? = p202/9 


Thus, the constant coupling 1/g has been replaced by an effective “running coupling” 
—41n(p?/M?), with energy dependence set by the scale M. (This is sometimes called 
the “renormalization group”, the group being related to scale invariance, which is 


broken by the introduction of the mass scale M.) 


We saw the same dimensional transmutation occuring in the effective potential 
in massless theories in subsection VIIB3. The form is similar because both are re- 
lated to the appearance of the renormalization mass scale ys from the breaking of 
scale invariance by quantum corrections, at either low or high energy: In both cases 
dimensional transmutation comes from a finite In yu? term arising from the infinite 
renormalization. The difference is that in the effective potential case we ignore higher 
derivatives, so the ? must appear in a ratio to scalar fields, while in the high energy 
case we look at just the propagator correction, so it appears in the combination p:?/p’. 


(More complicated combinations will appear in more general amplitudes. ) 


Exercise VIIB7.1 
Generalize this model to include internal symmetry: Write an analog to the 
scalar analog to QCD discussed in subsection VC9, where the “quark” @ now 
carries color and flavor indices, while the “gluon” y (classically auxiliary) 
carries just color. Find M, especially its dependence on the numbers n of 
colors and m of flavors. Write the same model with the gluons replaced 
by “mesons” carrying just flavor indices (so that classical elimination of the 
auxiliary fields yields the same action), and repeat the calculation. What are 


the different approximation schemes relevant to the two approaches? 
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8. Overlapping divergences 


We now perform some 2-loop renormalizations. Our first example is part of the 
propagator correction in ¢* theory. By restricting ourselves to the mass renormal- 
ization (coefficient of the mass term), we need evaluate the graph only at vanishing 
external momentum. (It is then equivalent to a vacuum bubble in ¢? theory, or a 
tadpole graph in the mixed theory.) Furthermore, we consider the case where some of 
the fields are massless. In such a theory, we encounter (a special case of) the 2-loop 
graph of subsection VIIB1, where 1 propagator is massless and 2 are massive. Ex- 
panding in e, and keeping only the divergent terms (1/e? and 1/e), we find (including 
a symmetry factor of 1/2 for the 2 massive scalar lines for real scalars) 


(3—D 


LD D D D 


x tm?[I"(e)|?[-1 — 3e + 2€ In($m”)] 


To this we need to add the counterterm graph, coming from inserting into the 
1-loop massive tadpole J; (with 2 external lines) the counterterm A, (for renormaliz- 
ing the ¢* term) from the 1-loop divergence in the 4-point graph with 1 massive and 
1 massless propagator. (Since the massless tadpole vanishes in dimensional regular- 
ization, we need not consider the counterterm from the 4-point graph with 2 massive 
propagators.) From section VIIB6, we use the corresponding integral for a 1-loop 


propagator correction A, which is 
A=T(e)+finite => A,s=—-I(e) 


We use a “modified minimal subtraction”, using the I’(e) as the subtraction instead 
of just the 1/e part of '(e) ¥ 1/e—7. 
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The 1-loop massive tadpole without coupling is 


7 = ities) = ere) we —5m’I(c)[1 +e —€ In(Zm’)| 
(may F 


Combining these results, the divergent part of the 2-loop propagator correction, 


with 1-loop coupling counterterm contributions included, is 
Ty + AsT, = 4m? [I"(€)?[-1 — 3¢ + 2e In($m?7) + 2 + 2e — 2 In(Zm’)| 


=[Fe\rilso gm 


Thus, the In m? divergences cancel, as expected. (Divergences must be polynomial in 
masses as well as couplings.) The surviving divergence is the superficial divergence, 


to be canceled by the 2-loop mass counterterm. 


Exercise VIIB8.1 
Calculate the p? part of the 2-loop kinetic counterterm by writing the above 
2-loop propagator graph with nonvanishing external momentum, introducing 
the Schwinger parameters, doing the loop-momentum integration, taking the 
derivative with respect to p?, and then evaluating at p = 0. Why is there no 
subdivergence (1/e*)? 

Exercise VIIB8.2 
Calculate the complete (all graphs, infinite and finite parts of the) 2-loop 


propagator correction for massless $4. (See exercise VIIB4.3a.) 


For our next example we consider massless ¢° theory, and work in 6 dimensions, 
where the theory is renormalizable (instead of superrenormalizable, as in 4 dimen- 
sions). For the 2-loop propagator correction, there are only two graphs (plus 1-loop 
graphs with 1-loop counterterm insertions), one of which is simply a 1-loop propagator 


graph inserted into another. 


OO 
OO 
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The other graph is 


1 
P= i dk dg. TS 
HR+ a)? HR + Bp? HR — gp) 40+ gp) a0 — BP 
(with a symmetry factor of 3 for real scalars). This graph can be rewritten as iterated 
propagator corrections by use of integration by parts in momentum space. This is 
legalized by dimensional regularization, since boundary terms vanish in low enough 
dimensions. All invariants can be expressed as linear combinations of the propagator 
denominators (there are 5 of each, not counting the square of the external momentum 
p*), so any product of momentum times derivative acting on the integrand will give 
terms killing one denominator and squaring another, except for p? terms, which can 
be canceled by appropriate choice of the momentum multiplying the derivative: 
O k 
pra sr 21 I,,\21 a L,\21 T3279 
3(k + q)?5(k + 9p)*5(k — 5p)?5(a + 3P)?5(9 — ap) 
This operation effectively gives the factor 
0 (q— 5p)” , (atop)? (k+q)? — (k+4)? 


ag (kta) > (D-4) + 


We thus have 


1 1 
Pi = ja ae | ee 
' [a(k + 3p)?)?5(k — 3p)? 3(q + 3P)?3(q — 3p)? 


1 1 
P2 = / dk, ——__—__________ / dq ————.-——_— 
[ak op)" |?a(k = gp)? a(k +.9)?5(q+ 3p)? 
The former term is the product of two 1-loop propagator graphs, the latter is the 
insertion of one 1-loop propagator graph into another. 
Both graphs can be evaluated by repeated application of the generalized massless 
one-loop propagator correction (with arbitrary powers of free propagators) given at 


the end of subsection VIIB4. The result can be expressed as 


eee a 
5P 

['(D — 3)]?I'(5 — D) 
Pe=5) P= ora =) 


in terms of the 1-loop propagator correction Po. We therefore modify our minimal 


Po = cP, C= 


subtraction so that Pp has the simplest form (G scheme): 


Po -_ —4 (3p”)'~* 
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where D = 6 — 2e, and we calculated the coefficient of the 1/e term and threw in a 


normalization factor that canceled the rest: 


ho Nh 


! P05) 
Pa eS 
3D -OTO-DBe-1,2-1) 1 30-9) 
Further evaluating c, we find 
1 1— 2 ['(1 — 2e)]?'(1 + 2e) 


c=-5 


Using the expansion of In I’(1 — z) in terms of y and ¢(n), it is easily checked that 
this combination of I’’s is 1+ O(e?), so we can just drop them. Collecting our results, 
we have 

3 — 2€ i 1 — 2e 


to¢ 3° — Bed — 3e) 


P= Prsais 


Exercise VIIB8.3 
Calculate the same graph in four dimensions. It’s finite there, so no countert- 
erms are necessary. However, in this case integration by parts gives a factor 
of 1/e, and each of the two resulting graphs has an additional factor of 1/e?. 
The result then has a factor of 1 minus the previously obtained combination 
of I’’s, which we already saw was of order ¢?. The final result is thus obvious 


except for a factor of a rational number: 


(The on-shell infrared divergence is as expected from power counting.) 


We next calculate the counterterm graphs. These are the ones that cancel the 
subdivergences coming from the 1-loop 3-point subgraphs. We therefore need the 
divergent part of this subgraph. This is easy to evaluate by our previous methods: 
The result of Schwinger parametrization, scaling, etc., doing all integration exactly 


except over the Feynman parameters is 
1 
dq 5 1 ba inet b— Ip 
50 GP hon) ea = 5p) 


= fa sD ayr(B— Fibs a4)(R-+ pe)? + $a — a) $y)" 


= 34+ finite 
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by simply replacing the factor in brackets by 1 (since it is raised to the —e power), 


where we have used 


[ @asa- De = fda fa 
0 


Since we know the divergence is momentum-independent, we can obtain the same 
result from a (infrared regularized) tadpole graph with its propagator raised to the 
third power: In the notation of subsection VIIB1, 


P(3- 3) 


A, (3,0,m?) = TB) 


i Bis . 
T@) (}m?)P?* = 32+ finite 


The contribution of the 2 counterterm graphs (or one for the effective action if we 


drop the symmetry factor) is thus 
2As3Ppy = 2(—32)Po 


Collecting terms, we have 


1 — 2¢ 1 — 2e 
P+2A2P,) = —it 5° {+4 1,,2\1—2e 11/1,,2\1-e 
300 ~ 72 =— 3° — 3e)(d — 3e) (5D) te ca(5D ) 


After a little algebra, dropping terms that vanish as € — 0, we find 


P + 2AaPo = (3P\lisa — wet — ln gp’)? + gln op” — Fl 
Note that modifying minimal subtraction is equivalent to redefining Sh, which 
we have set to 1, but which appears only in the In’s, as In(4p?) > In(p?/p?). Thus, 
modifying NV, which appears only in the combination (4p?) ~, is the same as shifting 
In 5D": 
N=>Ne* = In gp? >In gp’-a 


For example, choosing a = —, 


Only the O(e) part of the normalization factor affects the final result: More generally, 
(sp°)§ > N(gp’)* => In §p? S In Gp? — tin N 


Since after adding counterterms, which cancel nonlocal divergences arising from subdi- 
vergences, /n’s appear only in finite terms, only the O(e) part of In N will contribute. 


Thus, we can approximate any normalization factor as 


N & e, a= 7+ rational 
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as far as the renormalized results are concerned. (The ¥ identifies this normalization 
as modified minimal subtraction, as the MS or G schemes.) This is just the statement 
of the renormalization group, that the final result in minimal subtraction schemes 


depends only on the choice of scale: The complete normalization factor is really 
Neotal = N(sp")° => In Sh — In She + tin N 


However, the higher-order terms can be convenient for intermediate stages of the 
calculation. In this particular case, the nonlocal divergences appearing before cancel- 
lation are of the form (1/e)In p”, so the O(e?) part of N contributes at intermediate 


stages. For example, replacing the original VV with 
N’ = (1- 2e)(1 — 26) (1 — €) 


would have given the same result even before cancellation, since the change is by 
another combination of I’’s that give 1 + O(e?). 


Exercise VIIB8.4 
Complete the 6D calculation of the exact 2-loop propagator correction in 
@°® theory, including the missing graph and counterterms, to find the total 


renormalized 2-loop propagator and its 2-loop counterterms. 
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saptsaasan tegen eres C. RESUMMATION ....................... 


So far we have only assumed that confinement arises nonperturbatively in 4D 
QCD. However, to connect with known, successful results of perturbation theory, we 
need to understand how the same methods used to give these perturbative results 
can be generalized to include the nonperturbative ones. The simplest method would 
be to take the perturbation expansion as is, and find a good method for evaluating 
(or perhaps redefining) its sum, with the hope that summation to all orders by itself 


would reveal features invisible at finite orders. 


Besides the technical difficulties associated with such an approach, the main prob- 
lem is that the summation of the perturbation expansion does not converge. Parts of 
this problem can be solved by appropriate redefinitions, but other parts indicate a se- 
rious problem with perturbation theory, caused by the very renormalization that was 


supposed to solve the main problem of finite-order perturbation theory (infinities). 


1. Improved perturbation 


We saw in the previous section that dimensional transmutation replaced the di- 
mensionless coupling constant with a mass scale. In principle, we would like to 
explicitly make this replacement as the basis of our perturbation expansion, not 
only to make the perturbative parameter physical, but also to take into account 
the running of the original coupling. Unfortunately, this is not possible in practice; 
however, we can choose the arbitrary (unphysical) renormalization scale py to be in 
the range of energies in the problem at hand, so that the In(p?/j?) corrections are 
small. A change in scale from one value of y to another is related to a resummation 
of graphs: Although the one-loop term in the effective action containing In(p?/,:7) 
comes from a single 1PI amplitude, it contributes an infinite number of terms at dif- 
ferent loop orders to the propagator when inserted into any higher-loop 1PI graph, 
as 1/(k + A) =1/K —-(1/K)A(1/K)+.... Although Kk + A depends only on M, Kk 
depends only on g and A depends only on yp. Thus, any redefinition of that leaves 


the physical quantity M/ unchanged requires a corresponding redefinition of g: 


2 2-1/9? 2/2 1 
M?=peVF = g?(u?) = ——_~ 
In (47) 
and thus changing y redistributes the contributions to 1/( + A) (and therefore to 
the summation of graphs in any amplitude) over the different loop orders. For exam- 
ple, if the amplitude is most sensitive to the momentum in a particular propagator 


independent of loop momenta), and we choose pi? ¥ p”, then although we can’t use 
m 
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the resummed perturbation expansion directly, we can at least push most of it into 


the lower orders. 


Things get more complicated at higher loops: It becomes difficult to associate 
the running of the coupling with the resummation of a particular subset of all the 
graphs. However, we already know that this effect can be derived from the breaking 
of scale invariance by renormalization. For example, let’s consider Yang-Mills theory, 
since gauge invariance restricts it to have only a single coupling parameter. (This 
makes it the simplest case conceptually, although not computationally. Here we use 
only the fact that it has a single coupling; its explicit renormalization constants won’t 
be considered until chapter VIII. As an alternative, we can consider the scalar QCD 
analog of subsections VC9 and VIIB7, a ¢* theory with an auxiliary field, if we ignore 
mass renormalization, or arbitrarily renormalize the mass to zero.) For convenience 
of dimensional analysis, we use only coupling constants that are dimensionless in all 
dimensions, by scaling with an appropriate power of yu. (In general, we can do this 
even for masses.) The classical Yang-Mills action, before and after the addition of 


counterterms, is then 


1 1 
Setass _ <a | ae tr sF?, Polaak +AS = rs) pe tr <F? 
g . g 


Pai) 
where 
1 1 1 aa = 
=o |) 669) |\4 Gola Oy Ge 
9 (gH) F Le ] i) d! 


for some numerical constants c,z. (We can also include h’s as g? — gh.) We use 
$17 to produce the combination ($p?/3?)~¢ in graphs. (In practice, one uses units 
3H” = 1 until the end of the calculation, and restores units.) The ,. dependence is 
then given by varying py for fixed g: 


O O 
2 7a? ae 2 2 m2 
Maps =~ 69 Big), Reo = 0 


where the eg? term is the classical contribution, and g’s independence from pu is the 
statement that the physics is independent of the choice of p (i.e., g depends on only 
M and ¢). is independent of € (except indirectly through g?): By definition, g is 
finite for all D and p, so @ has no 1/e divergences; but also 3 can have no positive 
powers of ¢€, since that would create such contributions in the derivative of g that 
could not be canceled at any finite order in the loop expansion. We then find 


aye =f! 92 a ee ee ce 
= Pe Or GP = B =9 dg? (9 Ci) Ag? (g Cea) — Pagar 
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Thus, the coefficients of the 1/e terms determine those of both the higher order terms 
and 3. 


This gives us an expression for (3, 


B= SG Bin, By = Ley 


L=1 
Since g? is itself unphysical, the information we can get from analyzing the running 
of this coupling is arbitrary up to redefinitions. For example, assume that all @; are 


nonvanishing, and write the definition of G as (in D = 4 > € = 0) 


Then under a redefinition g? — g?(g’”) we have 
01 _ (AQ/g*)\" 
i 2p JV pie i 
a Oi? go? Gar f(g (9 )) i (9 ) 
Now we consider a “perturbative” type of redefinition, as results from changing renor- 


malization prescriptions, so g? gets only “O(h)” corrections: Taylor expanding 


g° = qe ! kg" ame kag’ O(g'°) 
1 1 2 
== 7D + constant + O(g"*) 


we find 
=1+0(9"), — f(g*(g?)) = f(g") + O(9"*) 


=> f(g") =f(97) + O(9") 
Thus, the first two coefficients of @ (3, and 3.) are unaffected, while terms found at 3 


loops and beyond can be modified arbitrarily, and even be set to vanish. In the more 
general case of more than 1 coupling, it is sometimes possible to eliminate also some 


of the 2-loop contributions. 


Therefore, to consider the general behavior of the coupling as a function of energy 


(7), it is sufficient to solve the equation 
O 
20 9 _ 9,4 26 
Mad Big’ — Bog 
(using, e.g., the change of variables t = In pu? and u = 1/9?) as 


= 2 
we = gee i ue Bo ae 
M? gf Bh 
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with M? as the constant of integration. Using an allowed type of redefinition for g?, 


and also redefining the arbitrary constant of integration M?, we can simplify this to 


1 1 6 


ae eee M? — M2e8/6i 
gf gg fh 
ig* ye 1/819? (_,2) 62/8? 
2 Pee a 
al 


(This redefinition changes the range of what g? is called negative and what positive. 
However, g’ is just a parameter, not a physical coupling: As far as the unitarity of 
the kinetic term is concerned, only the residues near the poles of the propagator are 
relevant. Also, our allowed class of redefinitions does not affect behavior for small g’, 


and thus perturbation theory.) 
Exercise VIIC1.1 


Let’s analyze this solution in more detail: 


CE aD 


a Graph the function y(z) = e%*x? (or graph In y to make it simpler) for a 
and b positive, negative, and vanishing, to study the behavior of the function 
u?(g?). The analysis can be simpliifed (and the behavior for different values of 


a and b related) by considering g? positive and negative, and the symmetries 
a—~> —a, v2, a (—1)’y 
I 


i =), a ee y— (-1) 
Sat are 


Note that g? can be nonpositive for some values of y?: For example, even for 
Bo = 0, we have g? = 1/,ln(u?/M?), which is negative for  < M or for 
pp > M. What happens for 6) 4 0? 


b After applying the above redefinition, apply the second redefinition 


Find the new 8 and y?(g”). Compare to the behavior of ?(g?) before this 
redefinition, for the cases 32/3, < 0, noting the “duality” symmetry g 
(—21/(G2)/g. 
Exercise VIIC1.2 
Consider some theory with a single dimensionless coupling g?, but now also 
a single mass m. By the above methods we find 
wo, 


a” = m’[-1- Bm(g*)| 
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(The m dependence follows from dimensional analysis.) Solve for m? as a 
function of g?, as an integral over g? in terms of 3 and 6,,. Show that after 


an appropriate redefinition 


for some constant 31. Solve for m? explicitly in terms of g?, when we have 
also redefined 3 to 3,g4 + Bog°. Then make the final redefinition 1/g? — 
1/9? — 32/G, used to simplify M?. 

For purposes of perturbation theory, it is useful to invert this: For small g?, we 


have approximately 


This implies that the terms in the effective action that carry the I dependence are 


Iv wir fav} if auinse =U + Pn Ge iz) F 


(We can also replace -O) — C in this limit, ignoring iz’s in comparison to In’s.) 


given by 


The general class of coupling redefinitions we considered are allowed by pertur- 
bation theory: If we knew the exact solution to a field theory, we would be more 
restrictive, requiring invertibility. However, in perturbation theory, given two renor- 
malization prescriptions related by some such coupling redefinition, we might know 
this redefinition only perturbatively, and perhaps only to a few orders. Even if we 
knew it exactly, and knew it to be noninvertible, it still might not be clear which of 
the two prescriptions were the correct one, if either. Therefore, the renormalization 
group alone is sufficient to draw conclusions about the behavior of a theory only at 
“small” (< 1) coupling. 

Similar remarks apply to propagators, S-matrix elements, etc. Consider any renor- 
malized function G,, appearing as the coefficient of n fields in a term in the effective 
action. The renormalization of the unrenormalized G,, is taken care of by the com- 
bination of the use of g for the coupling and wave-function renormalization factors 
Z: 


2 
a —f A a) d A 
GilP HP?) = EPH) OGG), a= 
20 ud 
> Ga ; +m) G=0, Vag 


(Z is not required for pure Yang-Mills in the background gauge; or we can examine 
ratios of such quantities where the Z’s cancel, which are more physical, such as S- 


matrix elements.) 
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Unfortunately, this behavior of the theory at high energy leads to problems upon 
resummation of the perturbation expansion. The basic idea of dimensional trans- 
mutation is that the effective action will depend on g and yw only through M. For 
example, in an asymptotically free theory at high energies, propagators (which are 
where this effect shows up) will depend on momentum as a function of only p?/M?. 
More specifically, if we restrict ourselves to just the 1-loop contribution to the @ func- 
tion for simplicity, which will appear in the effective action as a logarithmic correction, 


propagators will depend on 


2 2 
P 1 P 
pan (Sar) = get (Fa) 


(In fact, at one loop in pure Yang-Mills, this is the exact modification of the kinetic 
term.) We now consider analytic continuation of the propagators in this argument: If 
we continue in p”, we know we must find the usual cuts from multiparticle intermediate 
states, at negative p?, extending to p? = —oo. But since the propagators depend on 
g° and p” only through this combination, we must find the same result if we keep p? 
fixed and analytically continue instead in g?: The cuts in g? are thus located at 
= = real + 3,(2n + 1)ai 

for arbitrary integer n, where “real” means starting at some real value and running 
to too. If we plot these in the complex g? plane, we can recognize this equation as 
describing circular arcs running through the origin, with centers on the imaginary 
axis: 

ljg-wPa=r’ ea ee 

2 & ir 

These arcs approach the origin from the positive side (but from either the upper or 
lower plane), with radii 1/6,(2n + 1)27. Normally one would like a small region of 
analyticity about the origin for the perturbation expansion to converge (the nearest 
singularity giving the radius of convergence). Barring that, a wedge of small angle 
about the real axis will do (for a “Borel sum”: see subsection VIIC3). In this case, 
any of these arcs prevent even that. In the following subsections we will examine and 


interpret the causes and effects of this behavior. 


2. Renormalons 


The perturbation expansion in general can’t be resummed in the naive way be- 
cause the number of diagrams increases as n!(constant)" at n loops. The simplest 
example of this is a self-interacting scalar in D=0: 


© de alae «— 
VE ~. 


oo V 20 
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Note that Z = 1 for g? = 0, and Z < 1 for g* > 0, but Z = oo for g? < 0. That’s 
why the perturbation expansion doesn’t converge. It also suggests that for any fixed 


g° the expansion will start to diverge when n is of the order of 1/g?. 


Since there is no momentum integration, each diagram is just 1 (times some 
permutation factors), so Z,, just counts the number of diagrams at n loops. We use 
g’ so the coupling is similar to that in Yang-Mills: As usual, we can rescale ¢ > $/g 


to recognize g? as h: 
P=99 + 50° + 49°" = S56" + 70") 


(Of course, we can be more explicit by writing hg? in place of just g? or h, but the 
effect is identical, since they both appear only in that combination.) This integral 


can be evaluated exactly at any order of perturbation theory: 


oe a 
where we have used the Stirling approximation for I(z) at large z. 


Exercise VIIC2.1 


Find the following properties of the /’ function for large argument: 


a Derive the Stirling approximation 


tim r(2) = yf (2) 


by applying the method of steepest descent to the integral definition of 
I'(z+1). (See subsections VA2 and VA5.) 


b Use this approximation, and lim,z.(1 + +)? = e, to show 


lim P(az +b) & V2n(az)? te V2e-% 


200 


Thus we might as well apply the steepest descent approximation directly to the 


original integral: Using also an integral for h (= g? in this case), 


= fe f0( i) 


we first apply steepest descent to the ¢ integral, yielding the usual first two terms in 
the JWKB expansion. Then the h integral can be approximated as I’(n) by keeping 


only the part of the contour on the positive real axis: 


dh gm 1 1\" 
¢ ante oa (5) 


68/5¢=0 68/5¢=0 


C. RESUMMATION 499 


1 Ca ae ty 
smn Sy feee(28)]"e—m(2) 


5S/56=0 
S40 


(S = 0 solutions contribute only to Zp. A similar result can be obtained by simul- 
taneously using steepest descent for the fh integral, yielding a “classical value” of h 
in terms of S.) This approximation is the same poor approximation as that used for 


—S/h is not Taylor expand- 


the perturbation expansion: ignoring negative g?. In fact e 
able, and the coefficients in its Taylor expansion (from doing the integral exactly) all 


vanish! In the present case, the nontrivial classical solutions are 

S=3¢+7¢ => =H 
which gives the same Z,, as previously (being careful to sum the two terms for the 
two solutions). Thus, we see that in general we have to sum 7, n!(h/S)", which 


does not converge. Furthermore, this divergence is associated with finite-action (“in- 


stanton” ) solutions to the classical equations of motion. 


The simplest example of a resummation problem is the one-loop propagator cor- 
rection. We have seen that the classical and one-loop kinetic terms can be combined 
to give a kinetic operator of the form (3K (p*)In(—p?/M7?) in massless theories, or 
at high energy in massive theories, where K is the classical kinetic operator. The 
free (or asymptotic) theory has solutions where this kinetic operator has a zero (the 
propagator blows up). Besides the classical solution at K(p?) = 0, there is another 
at p? = —M?: 

1 _ 1 _ M? 1 

BLK (p?) In (#7) 6, K(p2) In (1 — ent) ~ BKM) * P+ OP 
This might be expected to be a bound state, called a “renormalon” because of its rela- 
tion to the renormalization group. However, the residue of this pole in the propagator 
can have the wrong sign, indicating the appearance of a ghost (“Landau ghost”), and 


thus a violation of unitarity. 


Exercise VITC2.2 
The Landau ghost itself is not necessarily a problem in quantum field theory, 
although it indicates the possibility of such problems. Examine the behavior 
of this ghost after taking into account the 2-loop correction (32), before and 
after the simplifying redefinition of the previous subsection, for all the various 
signs of 3, and 65. Since the expression for ju?(g”) can’t be inverted, use the 
fact that the propagator follows from the coupling g?(?) as 
A a 
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(Field redefinitions can’t remove the momentum dependence of couplings.) 
Then new poles (or other singularities) in the propagator correspond to the 


limit g? — oo, so find p?(g”) there. 


This causes problems similar to those from instantons when the quantum prop- 
agator is inserted into another graph. We set external momenta to vanish, as an 
approximation for high energy for the loop momenta, or to evaluate low-energy quan- 
tities such as anomalous magnetic moments. In any one-loop 1PI graph with n 1-loop 
propagator insertions and I external lines, we get an integral at high energy of the form 
(e.g., in QCD or the scalar analog with auxiliaries of subsections VC9 and VIIB7) 


2 n oo n 
je (ey 0 In (=) a)" f du ety” wn! (-4)) 


where we changed variables to u = In(k?/y?) (remembering '(n + 1) = n!). We 


have used effectively an infrared cutoff by approximating the u-integral from 0 to oo 
instead of —oo to oo. If we look instead at the low-energy (of the loop momentum) 
behavior, now taking / — 1 massive classical propagators with 1 massless propagator 
(to insure IR convergence) with n insertions, we find 
ak (kh? +m?) "(Wy |-A I ee wae [du et(—wyt =nigr 
| im (5) ~ af u e“(—u)” = nl By 
Since the former comes from UV behavior it’s called a “UV renormalon”, while the 
latter coming from IR behavior is called an “IR renormalon”. The essential difference 
is the relative factor of (—1)". In fact, the former expression is also the high-energy 
limit of the latter (neglecting masses then), so the complete integral (wu from —oo to 
oo, so k? from 0 to oo) can be approximated as the sum of the UV renormalon and 


IR renormalon contributions. 


3. Borel 


Since renormalons and instantons cause the perturbation expansion to diverge by 
a factor of n!, we look for a method to formally sum such series, by relating them to 


series that do converge. In general, we consider the series 


A(h) = >. han 
n=0 
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and define the “Borel transform” as: 


ih= / me aR) si arp) 


ic. 200 
(for some real number r to the right of all singularities of A) in anticipation of 
instanton-like contributions. The inverse is 


A(h) = [ dz e*/" A(z) 


0 
The inverse Borel transform is related to the Laplace transform (with the variable 
change x = 1/h) and the Mellin transform (# = 1/h and y = e*). Evaluating explicitly 


for the above series, 


A(z) = 6(z)ag + SS 2" n+ 


n=0 
So the Borel-transformed sum converges faster by a factor of n!, which is just what 
we need for perturbation theory. The idea for resumming the perturbation expansion 
is to first do the Borel sum, then inverse Borel transform the resulting function. Of 
course, this procedure does not necessarily fix the original problem, which might 
merely be translated into problems of convergence or ambiguity for integration of the 
inverse transform. In particular, we need A(z) to be well defined along the positive 


real axis. 


We saw that generically the sums involved were approximately of the form 
A(h) ~ SA" (n — 1)\(—-k)” 
n=1 


In that case 2 


A(z) ~ $0 (-1)?2"k"*1 = 


n=0 


1 
When k < 0, this leads to a singularity in the integral defining the inverse Borel 
transform. It can be “regularized” by choosing a contour that goes around the pole, 
but the choice of contour is ambiguous, and choosing an arbitrary linear combination 


of the two contours introduces a free parameter. Explicitly, we have 
A(h) = Ao(h) + Ce7 M/F 


where Ap is the result of a particular prescription (e.g., principal value), and ¢ is the 
new parameter. The ¢ term is clearly nonperturbative, since each term in its Taylor 
expansion in f vanishes. This new parameter can be interpreted as a new (nonper- 
turbative) coupling constant in the theory, just like ambiguities in renormalization of 


new counterterms in perturbatively nonrenormalizable theories. 
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Now we more carefully analyze the explicit sums we found in the previous sub- 
section. The first example is VhZ for D=0: 


A(z) _ [- d(1/h) ez/h - do eS/h = = do 


1 1\—1 
as On eg OE = pe ae) 


(The contribution from S = 0 is artificial, coming from our using A = VhZ instead 
of Z.) So, this integral can be explicitly evaluated. (For example, for the action we 
used in the previous subsection, we can explicitly solve for @ at S = z.) However, 
there is then a problem in inverting the Borel transform: Near z = z = S(¢o) for 


classical solutions @9, we have 
S(o) © S(¢0) +35" (¢0)(¢—¢0)? = S'(o) = S"(db0)(@—-¢0), 2-2 © 55" (b0)(¢—$0)? 


= (St 28") (2 — a)? 


Therefore, there are cuts with branchpoints at classical values of the action, leading 
to ambiguities in the result for A(h). We thus see that new coupling constants are 
introduced for each solution to the classical field equation with positive action. (For 
our D=0 example S < 0, and there is no problem, but more realistic examples, like 
Yang-Mills instantons, have S > 0.) 


Exercise VIIC3.1 
Consider the D=0 action 
$= 10-0! 
which differs from our previous example by the sign of the interaction. Now 
we have classical solutions with S > 0. (The interaction is the wrong sign for 
the integral to be well defined, but the “kinetic” term is the right sign for it 
to be defined perturbatively.) Explicitly evaluate $(S) (i.e., @(z)), and show 


it has the above behavior near z = S(dp). 


In the case of renormalons, we see from the previous subsection that the large-n 
behavior gives singularities at z = N/(G, for positive integer N. This integral also is 
easier to evaluate after Borel transforming: We consider a one-loop graph, but replace 
one internal line with the “full” quantum propagator coming from the 1-loop effective 
action (the same as summing a string of 1-loop propagator insertions), while using 
massive propagators for the remaining lines. We thus examine first the transform of 


the quantum propagator 


[ee oe 1 1 (k2\~ 
e SS = SC — — 
2nt k? = +B, In(k?/u?) kk? \ i? 
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by closing the contour on the left. Then inserting this transformed propagator into 


the complete diagram, 


1 1 k2 —Piz m2 —Giz 


using the integrals of subsection VIIB1. This expression is the sum over n of the 
UV/IR renormalon example at the end of the previous subsection, except that we 
have done the summation over n as the first step (and used the Borel transform 
to assist in the evaluation). The first I’ has poles at z = N/G, for positive N, 
representing the IR renormalon, which are relevant for 3, > 0, but the second J’ has 
poles at z = —(N +1 —3)/(, for positive N (and | > 3 for the original diagram to be 
UV convergent), representing the UV renormalon, which are relevant for 3, < 0. To 
the one-loop approximation for the 3-function we have used, the singularities are just 
poles, but if the two-loop propagator insertions are used, these singularities become 


the branchpoints for cuts. 


The new coupling constants that appear nonperturbatively can be given a physical 
interpretation in terms of vacuum values of polynomials of the fields. The basic idea 
is analogous to perturbative tadpoles: In that case corrections to S-matrices due to 
vacuum expectation values of scalar fields can be expressed by propagators that end 


at a “one-point vertex”, whose coefficient is the vacuum value of the field: 


(b(a)) = / D¢ e4(x) = c 


in position space for some constant c, or in momentum space as 


/ Do e°o(p) = c6(p) 


Similarly, we could expect graphs to have two propagators that end at a two-point 
vertex representing the vacuum value of the product of the two fields associated with 
the ends of the two propagators, and so on for higher-point vertices. For example, 


for a ¢? vertex in a scalar theory, it would correspond to a contribution of the form 
/ Do eS $(2) oly) = (24d) +... 
in position space, or in momentum space 
[ 26 Sop ola = (2 + )50)5(0) + 


where c? is the contribution from (¢)?, so c’ represents (A¢)? = (6?) — ()?. Such 


vacuum values do not appear in perturbation theory for higher than one-point; we get 
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only one 6(p) for each connected part of any graph. However, such contributions would 
be expected to give similar contributions to those we have found for renormalons: By 
dimensional transmutation, a contribution to an amplitude of the form ¢e~”/"" must 


appear in the combination 


- - 2\n M2 n 
com one) =< (2) 


This is the type of contribution expected from a propagator with tadpole insertions, 
or in the same way from any other type of vacuum value. In particular, in QCD 
there are no fundamental scalar fields, but only scalar fields can get vacuum values, 
by Lorentz invariance. Thus, the vacuum values come from composite scalars, like 
tr(F?), dq, ete. 


Note that renormalons are a feature of renormalizable theories: They do not 
appear in superrenormalizable or finite theories. In particular, the path-integral 
methods of “constructive quantum field theory” have been used to show that cer- 
tain interacting field theories in lower dimensions can be proven rigorously to exist 


— superrenormalizable theories with unique vacuua. 


4. 1/N expansion 


Perturbation theory is insufficient to evaluate all quantities in quantum physics, 


since 

(1) such expansions don’t always converge; 

(2) if they do converge, they might not converge to the complete result; and 

(3) even if they do give the complete answer, their summation might not be practical. 


There are many perturbative expansions in quantum field theory. When we say 
“perturbation theory” in this context, we generally mean an expansion in the number 
of fields (or, in diagrammatic terms, number of vertices), since in the path integral 
we kept the exact quadratic part of the action but expanded in powers of the inter- 
action terms (cubic and higher). (This is usually also an expansion in the coupling 
constants, depending on how we define the fields, which can be redefined by factors 
of the couplings.) One disadvantage of this expansion is that it violates manifest 
gauge invariance: Nonabelian gauge transformations are nonlinear in the fields, and 
thus mix diagrams with different numbers of fields. (These are the internal fields; 
external fields are asymptotic, and approximated as free.) Graphs that are related by 
gauge transformations must be added together to obtain gauge-invariant, and thus 


physically meaningful, expressions. Also, in practice individual graphs contain “gauge 
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artifacts” that complicate them in certain gauges, but cancel in gauge-invariant ob- 


jects, like S-matrix elements. 


There can be a large number of graphs contributing to a particular physical 
process (given set of external states) at any particular loop order. There is another 
gauge-invariant expansion that can be applied to Yang-Mills theory to subdivide these 
sets of graphs, based on the freedom of choice of the Yang-Mills group itself: We 
have seen that the classical groups are defined in terms of NxN matrices, where N is 
arbitrary. Clearly, S-matrix elements must depend on N, even if the external states are 
restricted to be group singlets or representations of an N-independent subgroup, since 
the number of internal states increases as some polynomial in N. We now examine 


how this can be used to define a perturbation expansion in terms of N. 


Sy 
<> 


We have already seen in subsection VC9 that the group theory of any graph can 
be detached from the momentum and spin (so we considered there a simple model 
of scalars ¢). We also saw there that the group theory of such matrices is most con- 
veniently graphed by a double-line notation, where each line acts group-theoretically 
as a bound (anti)quark, reducing the group theory to trivial Kronecker 6’s. We now 
notice that in some loop graphs, depending on how the lines are connected, some of 
the quark lines form closed loops. Again the group theory is trivial: There is a factor 
of N for each such loop, from the sum over the N colors. We can also give a physical 
picture to these numerical factors: Since we draw the scalar propagator as quark and 
antiquark lines with finite separation, think of the scalar as a (very short) string, 
with a quark at one end and antiquark at the other. This gives a two-dimensional 
structure to the diagram, by associating a surface with the area between the quarks 
and antiquarks (including the area at the vertices). We can extend this picture by 
associating a surface also with the area inside (i.e., on the other side of) each closed 
quark loop. In particular, for any “planar” diagram, i.e., any diagram that can be 
drawn on a sheet of paper without crossing any lines, and with all external lines on 
the outside of the diagram, the entire diagram forms an open sheet without holes, 
and with the topology of a disk (simply connected). It is also clear that, for a fixed 
number of loops and a fixed number of external lines, a planar diagram has the great- 
est number of factors of N, since crossing lines combines quark loops and reduces the 


power of N. 
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handle 
or 


2 faces 


We can be more quantitative about this N dependence, and relate it to the topol- 
ogy of the graph. In subsection VC2 we saw the number of propagators, vertices, and 
loops were related by P— V = L—1. This relation treats a Feynman diagram as 
just a graph, points connected by lines. We now consider a diagram as a polyhedron, 
with propagators as the edges, and closed quark loops as the faces, as defined by our 
use of matrices for fields. We then have as an additional relation for closed surfaces 
“Euler’s theorem”, 

F=P-—V-—2(H—-1) 


(in terms of the “Euler number” y = —2(H — 1) = V — P+ F), where F is the 
number of faces and H is the number of “handles”: 0 for the sphere, 1 for the torus 


(doughnut), etc. This follows from the previous relation: First combining them as 
LP 2 = 


we note that “cutting” any handle along a loop (without separating the pieces) pro- 
duces 2 faces; in other words, introducing two faces (as a “lens” ) into a loop that 
circles a handle changes the surface without changing the diagram, replacing 1 han- 
dle with 2 faces. The last relation then follows from the case with no handles, where 
each face gives a loop, except that the no-loop case corresponds to 1 face (or start 


with a less trivial case, like a cube, if that’s easier to picture and count momenta for). 


Using the fact that the g? appears in Yang-Mills the same way as hf, and that 
each face gets a factor of N, we find the g and N dependence of any graph is 


Gi ee = Ng Ny A 


We thus see that effectively Ng? is the coupling squared suited to planar graphs, 
counting the number of loops, while 1/N? is a new coupling squared, counting the 
number of handles. Therefore, we can sum over both Ng? and 1/N?: Each Feynman 
graph is a particular order in each of these two couplings. The sum of all graphs at 
fixed orders in both couplings gives a gauge-invariant subset of the graphs contribut- 
ing to a particular S-matrix element. (This is sometimes called “color decomposition” . 


Note that g? is the coupling normalized for matrices of the defining representation, 
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which was required here to define the 1/N expansion, while Ng? = $9? is the coupling 
normalized for the adjoint: If we had used matrices for the adjoint representation, a 
factor 1/g% would appear in front of the action, because of the difference in normal- 


ization of the trace of the matrices.) 


Exercise VIIC4.1 
Consider ¢4 theory in D=4, where ¢ is now an NXN hermitian matrix. Gener- 
alize the auxiliary-field propagator correction calculation of subsection VIIB7 
to leading order in 1/N, showing the N-dependence at all steps. Show that 
now, to this leading order, both the N- and g-dependence of the effective 


action can be absorbed into M. 


We can also consider more complicated models, such as chromodynamics, with 
fields appearing in the defining representation of the group, such as quarks. When 
a quark field makes a closed loop, it looks like a planar loop of a gluon, except that 
the closed quark line is missing, along with a corresponding factor of N. Thus, there 
is effectively a “hole” in the surface. Since only one factor of N is missing, a hole 
counts as half a handle. We can also draw a flavor-quark line for the quark propagator 
alongside the color-quark line. Since this line closes in quark-field loops, we also get 


a factor of M (for M flavors) for each quark loop. 
The fact that the 1/N expansion is topological (the power of 1/N is the number 


of holes plus twice the number of handles) closely ties in with the experimental ob- 
servation that hadrons (in this case, mesons) act like strings. Thus, we can expand 
in 1/N as well as in loops. While the leading order in the loop (Ng?) expansion is 
classical (particle) field theory, the leading order in the 1/N expansion is classical 
open-string theory (planar graphs). However, seeing the dynamical string properties 


requires summing to all orders in Ng? for leading order in 1/N. 


Thus, 1/N acts as the string coupling constant. (N appears nowhere else in the 
action describing string states, since they are all color singlets.) The experimental fact 
that the hadronic spectrum and scattering amplitudes follow so closely that of a string 


(more on this later) indicates that the perturbative expansion in 1/N is accurate, i-e., 
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that quantum corrections are “small” in that sense. One application of the smallness 
of 1/N (largeness of N) is the “Okubo-Zweig-lizuka rule”: A planar graph describes 
classical scattering of open strings (mesons). It corresponds topologically to a disk, 
which is a sphere with one hole, and is therefore order 1/N. Compare this to two 
planar graphs connected by a handle. It describes classical scattering of open strings 
with one intermediate closed string (glueball), where the handle is a closed-string 
propagator connecting two otherwise-disconnected classical open-string graphs. It 
corresponds to a cylinder, which is a sphere with two holes, and is therefore order 
1/N?. In terms of flavor lines, the latter graph differs from the former in that. it 
has an intermediate state (the glueball) with no flavor lines. The OZI rule is that 
amplitudes containing an intermediate glueball are always smaller than those with an 
intermediate meson. This rule also has been verified experimentally, giving a further 


justification of the 1/N expansion (though not necessarily of string behavior). 


Generalizing to groups SO(N) and USp(2N) gives more varied topologies: Since 
the left and right sides of propagators are no longer distinguishable, the string surface 
is no longer orientable (the surface no longer has two distinguishable sides), so we 
can also have unorientable surfaces such as Mobius strips and Klein bottles. One can 


also perform a separate expansion in the number M of flavors. 


The fact that the leading (planar) contributions are of order (Ng?)’~1 requires a 


modification of the Borel transform of the previous subsection: We now identify 
ih = Ng? 


instead of just h = g?, so we can use the 1/N expansion in conjunction with the Borel 
transform. In particular, this means removing the factor of N from 3; and absorbing 
it into h. The result is that the position of the renormalon singularities in the z plane 
is independent of N. However, the same is not true for the instantons: A one-instanton 
solution corresponds to choosing a single component of ¢@ nonvanishing in our scalar 
model, so that the classical solution ¢9 for the action S[¢o] has no N-dependence. 
(Choosing ¢ proportional to the identity matrix yields an N-instanton solution.) The 
analog in the Yang-Mills case is using just a (S)U(2) subgroup of the full U(N) to 
define the instanton. (Note that the structure constants for U(N) are N-independent 
for the defining representation: See exercise IB5.2.) Then $/g? = NS/h. The result 
for the positions of the singularities in z is then at integer multiples (positive or 


negative, depending on considerations given in the previous subsection) of z9, where 


1/P1 for renormalons 
A — 
° NS[@o| for instantons 
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where 3, and the one-instanton action S are N-independent. 


The net result is that instantons are unimportant for large N. Thus, if we take 
the 1/N approach of using a resummation to define a string theory, the instantons 
do not take a role in defining the string. (They might return in another form when 
considering classical solutions to the string theory, or their contribution might be 
just a small part of the total nonperturbative contribution.) On the other hand, 
approaches that analyze just the low-energy behavior of a theory can make use of the 
instantons: If the physical value of N is small, or the U(N) theory is spontaneously 
broken to give a small effective N at low energies (as in GUTs), then instantons may 
be treated as the dominant nonperturbative contribution to low-energy effects such 
as chiral symmetry breaking. This can be sufficient for studying low-energy bound 
states, but is insufficient for studying confinement, whose physical definition is the 


existence of bound states of very high energy. 
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Gauge invariance plays an important role in quantum corrections. It not only 
simplifies their form, but leads to new effects. In particular, it not only improves 
high-energy behavior, but can eliminate divergences altogether, in the presence of 


supersymmetry. 


In general, the first thing to calculate in quantum field theory is the effective 
action. Once this has been calculated, other properties can be determined: the 
vacuum, S-matrix, etc. In particular, in spontaneously broken theories, the effective 
action should be calculated with the symmetric (unbroken) vacuum, which has simpler 
Feynman rules; once the effective action has been calculated, vacuum values of the 
fields can be determined, and the S-matrix can be calculated as a perturbation about 
this quantum vacuum. (The alternative of defining Feynman rules for the classical 
broken vacuum and then calculating quantum corrections doubles the work in finding 


vacuum values. ) 


ecese eiacutaueueesats A. PROPAGATORS .......000.0.0.0.0...... 


We first consider propagator corrections in some specific theories with spin. In 
the following calculations we assume the gauge coupling appears only as an overall 
factor in the classical action: It thus also counts loops, so our 1-loop graphs are 
coupling-independent. All the integrals have been performed in subsections VIIB4 
and VIIB6; all that remains is the numerator algebra, which follows the examples 
of subsection VIC4. As we have seen, such corrections are important in analyzing 
high-energy behavior; as we’ll see in the following section, they are also important 
for low energy. (Of course, for massless particles the two are related by conformal 


invariance, even when quantum corrections break it.) 


1. Fermion 


ras 


Our first calculation is the one-loop correction to the electron kinetic operator in 
QED: The S-matrix element is 


“(K+ ob + 75) Ya 
a(k — 5p)?al(k + 5p)? + m?] 


512 VII. GAUGE LOOPS 


At this loop level the only difference between using D-dimensional y-matrix algebra 
(dimensional regularization) and 4-dimensional (dimensional reduction) is an unphys- 
ical finite renormalization, so for simplicity we’ll use the latter method. Then the 
numerator is 


+ 3p — v2m 


The result of the integral is then 
Ave = Pa’ 0,m?) — Ag(0, 0,m)] + (36 — V2m)A2(p?, 0, m”) 
in the notation of subsection VIIB6. The UV divergent part follows from 
A2(p?, 0,m?) = + + finite 


The contribution to I’ is minus the S-matrix element, but the counterterm has a 


second minus sign to cancel the divergence: 
As = tf ac w(—1id — /2m)\v 
The calculation for the quark self-energy in QCD is the same except for group-theory 


factors (see subsection VITA5). 
Exercise VIITA1.1 


Repeat the calculation with D-dimensional y-matrix algebra. What is the 


difference in the finite part, and why doesn’t it matter? 


In subsection VIIB6 we considered MOM subtraction (see subsection VI[A3) for 
scalar propagators. The analysis in this case is similar, but now we expand in p 


instead of p?: 

AK = a+ 0(% — f) + O[(2% —p)| 
However, since AK is normally expressed as functions of p? times 1 and ~, we need 
to translate: Using (5 + p)(5 — p) = 3(p* +m’), 


AK =a +0 (25 —p) + c4(p? +m?) + Ol — B)(P? +m), (P? +m’)? 
=e eee) Ole 2h 


i 


We next reevaluate the fermion propagator correction, to linear order in 5 p. 


Starting with 


_ | : 
Ale, p,m m3) = ja er per \-D/2--E 
a Tk + ip? + mek — ip + my 


E = 552" + ix $8p + 4X1 — B?)p” + $A[(mi + m3) + B(mi — m3)] 
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we keep only linear order in x and p?+m?, and set m,; = m, m2 = 0 (switching back 
toa = $(1+)): 


E & ix-(a—4)pt sAa(1 — a)(p? +m?) + 5Am?d? 


To clearly separate UV divergences (from  * 0) and IR divergences (from a * 0), 


we scale 
aN 


aeere => Ex (a—$)ix- pt gX(4—1)(p? +m’) + 5AM? 


lee) ul 
Aw | dy a ala | da a~*[1 — (a — §)iz - pl[l — $A(4 — 1)(p? + m’)] 
0 0 

The integrals are easily performed in either order: 


. 1 1 
A = P(1+6)(4m?)~* Poet +} _____ is -p 


1 1 ptm 1 1 1 p+m 
1 14 3 —_i : 
++ 3] m? Got: Ho) Pn 


and in the limit € — 0, 


rr 1 
Aw rase(gmy*|— +24 4e-p+ (4 
EUV €IR 


1 2 2 
+ ($41) icp? emi | 
EIR m 


The electron propagator correction to linear order in a5 — pis then 


a i ptm? 
Ao. = T(1 — s+ (4—4+1 
ale Mare ac 
1 1 ptm 
sO 14 y LA 9m 
[424 (G2 +1) So" | ge 2a)} 


rie @\. Coyle eas ol a (ep 
~ : pe EOLA Gig EUV EIR v2 
The 1/eyy terms are the same as the 1/e terms obtained above for minimal subtrac- 
tion. In the MOM scheme, this entire contribution (O(K°) and O(K‘')) is canceled 
by counterterms. 

Exercise VIITA1.2 

Repeat the above calculations replacing the fermion with a scalar. 
Exercise VIITA1.3 


Repeat the above calculations replacing the photon with a (massless) 


| + 

+ 

— 
Skee 
8 
+ 
5, 


a scalar 


b pseudoscalar (with a y_; vertex). 
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2. Photon 


We next calculate the spin-1/2 contribution to the photon (or gluon) self energy: 
The S-matrix element is 
| age al — oP + e)w(k + ob + %)] 
HOE fo)? +m? SR dP + me 
The result of the trace (again using 4-dimensional algebra) is 


—2[kaky — Navg(k” + $p” + m”)] — $(navp* — DaPo) 


The first part is the expression appearing in A,» in subsection VIIB6, once we recog- 
nize its 7, terms as the average of the denominator factors, yielding tadpoles. The 


integral thus gives 


(NasP” — PaP»)(—2A — $Az) & (4 — In p?)(—4)(masp” — PaPv) 


for the divergent and high-energy terms. Using 


A*(—p)(Nasp” — Pap) A’(p) = 5 F”(—p) Fas(p) 


in terms of the linearized field strength F’, the corresponding contributions to the 
unrenormalized one-loop effective action are (including a factor of 4 for identical 
external lines) 


Ty ne [ ae £F(2— In O) Far 


(4 


(neglecting the “—1” part of In(—O)) and the counterterm is thus 


b 


S = 4-2) fu gF Fy 


in the case of QED. For QCD, we must include the group-theory factor tr(G;G;) 


multiplying F'F (Examples will be given in the following subsections.) 


This propagator correction is easier to analyze in the MOM scheme than the 
electron propagator, since there are no internal massless particles, and thus no IR 
divergence to distinguish from the UV one. We therefore just take the explicit ex- 
pressions for the integrals from subsection VIIB6 and Taylor expand in p? about 0 
(or actually in 1/@ of VIIB6.1a, substituting for p? only at the end). The low-energy 
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part of the renormalized effective action for the photon, exhibiting the momentum 


dependence of the coupling, is then 


aft. «5 
even or. y fu aF 2 (= + 25) ee 


where we have applied MOM subtraction by canceling constant (infinite and finite) 


constributions to the coupling. 
Exercise VIITA2.1 
Evaluate this contribution to the unrenormalized effective action to this or- 
der. Show that the constant contributions to the coupling (to be canceled by 
renormalization) are 


asat3le-7-in(gm’) 


3. Gluon 


The most interesting case is the propagator of the Yang-Mills field, in a theory 
of Yang-Mills coupled to lower spins. There is an important simplification in this 
calculation in the background field gauge: Writing the classical Yang-Mills Lagrangian 
as tr F?/g?, the covariant derivative appears as V = 0+7A without coupling constant, 
so the gauge transformation of A is coupling independent, as in general for the matter 
fields. (In terms of a group element g, ¢’ = g¢ and V’ = gVg"".) The effective action 
is gauge invariant, which means the only divergent terms involving the Yang-Mills 
field are the gauge-covariantized kinetic (less mass) terms of the various fields. The 
divergences for the non-gauge fields are not so interesting, since they can be absorbed 
by rescaling those fields (“wave-function renormalization”), but the divergence of 
the tr F?/g? term can be absorbed only by rescaling the coupling g itself. (On the 
other hand, if we use V = 0 +igA, then renormalization of g requires the opposite 
renormalization of A to preserve gauge invariance.) Thus this divergence is related 
to the UV behavior of this coupling (as discussed in subsection VIIB7, and further 
later). The important point is that there is no wave-function renormalization for the 
Yang-Mills field (since there is no corresponding gauge-invariant counterterm), so the 
coupling-constant renormalization (like mass renormalizations) can be found from 
just the propagator correction, while in other gauges one would need also a much 
messier vertex (3-point) correction: BRST invariance is not enough to give the result 


from a single graph. 


516 VII. GAUGE LOOPS 


We now consider the contributions of spins 0 (including ghosts) and 1 (including 
gluon self-interactions), and redo the spin-1/2 contribution in a way that resembles 
the bosons. It is based on the observation that there is a universal form for the gauge- 
covariantized Klein-Gordon equation for spins 0,1/2,1, which can also be shown by 


supersymmetry. The kinetic operator in a background Yang-Mills field is 
K =-4}(0-iF ™S4,) 


where now HO = (V)? is gauge covariantized. This form is true in arbitrary dimensions. 
For spin 0 it is obvious. For spin 1/2, we use the fact that the one-loop contribution 
to the functional integral is the trace of the logarithm of the propagator, as follows 


from Gaussian integration, 
[ve Do e~?®* = det K=e* "K 


where the trace is over all indices, including the coordinates. Then the contribution 
to the effective action from kinetic operator K is 1/2 the contribution from K?. (See 
also exercise VIA4.2.) We then use (see subsection IIIC4) 


29? = -2(y- VP = 73+) VaVe =O + iS Fas 
where we have used 


1/2 
So = —Lye ry, {Ya, Yo} = —TNas 


In the case D=4, this is equivalent to the result obtained in subsection H1C4 in terms 
of just the undotted spinor, but there the 1/2 is automatically included because there 


are half as many fields, so the range of the trace is half as big. 


For spin 1, we use the result of the background-field version of the Fermi-Feynman 


gauge: At quadratic order in the quantum fields, from exercise VIB8.1 we have 
iF? 4+ 4(0- A)? > {2(DiaAy)? + 4 F[Aa, As] } + $(D- A)? 
= —1A-DA—isA Fy, A] = 44: (DO -iF Si) A 


where 0) = (D)? contains only the background gauge field, and in the last step we 


have written the quantum field A as a column vector in the group space and the 
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background fields (like F) as matrices for the adjoint representation (which replaces 


commutators with multiplication), and used the explicit expression 


A 
SY) =|a)yl, (ale) = Nab 


To this order in the quantum fields, the kinetic operator for the two ghosts looks just 
like that for two physical scalars, but gives a contribution to the effective action of 


opposite sign because of statistics. 


This method can be used for arbitrary one-loop graphs with external gluons, 
and easily generalizes to massive fields; we now specialize to propagator corrections. 
There are two kinds of vertices, the spin-0 kind and the vertex with the spin operator. 
Since tr Sjy = 0, we get only graphs with either 2 spin vertices or none. There is 
only one spin graph, with 2 internal free propagators; the 2 spinless graphs include 
such a graph but also a tadpole, which vanishes by dimensional regularization in the 
massless case. Since the spinless graphs give the complete result for internal spin-O, 
their sum is separately gauge invariant; the spin graph is obviously so, since it is 
expressed directly in terms of the field strength. (We refer here to the Abelian part 
of the gauge invariance, which is all you can see from just 2-point graphs.) As far 
as Lorentz index algebra is concerned, we need to evaluate only tr(S,,S-a). For the 
vector, we have 


yeerel 
(7 S.7) = 2Nv[cNd\a 


For spin 1/2, the traces are the same as in D=4 except for overall normalization; 


using earlier identities, or using the same methods for this case directly, 
1/2) a(1/2 
tr(So Sea’) = Gtr Dmoteraja 


where tr(JZ) is the size of the spinor. 
Exercise VIITA3.1 


Let’s look at other ways to interpret the last two identities: 


a Use the double-line notation (subsection VC9) for the defining representation 


of the orthogonal group to derive the above expression for the trace of two 


Ss. 
b Use the fermion action of IIIC4 in terms of just undotted spinors for D=4. 
Evaluate 
fd} atije 
ir( ttf?) 34 


using both bra-ket notation and double-line notation for SL(2,C). Show the 


result is the same as from vector notation (by relating Fi, and fag). 
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All diagrams will also have a group-theory factor of tr(G;G;) ~ 4;;. We'll be 
interested mostly in SU(N) for Yang-Mills theory (as appropriate to describe color in 
the Standard Model for N=3, or arbitrary N for applying the 1/N expansion). Then 
the most interesting representations are the adjoint (for the gluons and their ghosts) 
and the defining (for the quarks). As explained in subsection IB2, or as follows from 


the double-line notation of subsection VC9, we use the normalization 
trp(GiG;) = On => tra(GiG;) = 2N0;; 


Finally, there are the momentum-space integrals, which have already been eval- 
uated in subsection VIIB4 for the massless case (which is sufficient for determining 
the high-energy behavior, and thus the UV divergences) and VIIB6 for the massive 
case. The integral for the spin graph is the same as that for ¢° theory (using the S,» 
vertex from -—40 — siF OHS ee) As labeled there, the external line has momentum p 
and the internal lines k + Sp. Then the vertex factors in the spinless graph with two 
propagators are both simply —k (from —$0) = —30°4+5A-(—i0)+$(-70)- A+ 5A”), 
giving A,», while the addition of the tadpole, with vertex factor —7, converts it to 
Aas: (By comparison, the tadpole graph that was apparently avoided in the Dirac- 
spinor calculation of the previous subsection appeared anyway after evaluating the 
trace algebra.) This contribution also gets an overall tr(/) factor, simply counting 
the number of degrees of freedom. Note that the scalar factor A that appears in A,» 
is the sum of a divergent term proportional to the ¢°? graph and a convergent term 


that vanishes in the massless case. 


We now combine all factors to obtain the contributions to the two-gluon part of 
the unrenormalized 1-loop effective action (including the —1 for getting the effective 
action from the S-matrix, a —1 for internal fermions, either spin $ or ghost, the $ 
for identical external gluon lines, the 5 for the spinor to compensate for squaring the 
propagator, and yet another $ for identical internal lines if the group representation 


was real.) The result is the sum of contributions of the form 
Pag = tr / dx: LF*(4ep)(-1)*[_4<Bi(—D) — 4s°B,(-D) Fis 


where Cpr is the group theory factor from the trace, which for the interesting cases is 


7 { 2 for N@N (defining) 
*™ L2N for adjoint (real) 


This result applies to spins s = 0,4,1, with the understanding that it is the result 


for two polarizations, so there is an implicit extra factor of 5 for a single scalar, 
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while for massive spin 1 (spontaneously broken gauge theories) the third polarization 
in the (background-field) Fermi-Feynman gauge is carried by a scalar field. (The 
result above for s=1 is the sum of the contributions from the vector field and the two 
fermionic ghosts.) The functions 6; and By are the spinless and spin contributions, 
related to the massive ¢° propagator correction A» (p?,m?,m?) as 


2 — 
B,(p”) = A2(p?,m?,m”), B,(p’) = B2(p’) + 1m IE” 
Note that 

B, = Bye +—Inp? 


as far as divergent (at D=4) or high-energy (i.e., massless) terms are concerned. Also 
note that all contributions exactly cancel if all spins are in the adjoint and have the 
same mass, and appear in the ratio 1:4:6 for spins 1 (including ghosts), 3, 0: For the 
massless case, this is N=4 super Yang-Mills, which is also the massless sector of the 
dimensional reduction of the open superstring from D=10. The massless sector of 
the reduction of the open bosonic string from D=26 yields Yang-Mills plus 22 adjoint 
scalars, which cancels near D=4 up to a finite, local term (F*), which can be removed 


by a nonminimal renormalization. 


In examining the contribution of this term to the running of the coupling constant 
with energy, we see that the vectors contribute with opposite sign to lower spins. In 
particular, in terms of the coefficient (; (of subsection VIIC1), only nonabelian vec- 
tors make positive contributions (since Abelian vectors are neutral). This means that 
nonabelian vectors are responsible for any weakening in a coupling at high energies, 
known as “asymptotic freedom”, an important experimental feature of the strong 
interactions (see section VIIIC). Note that while the sign of 3, for ¢* theory, using 
the method of subsection VC9, is independent of the coupling (since all 1-loop correc- 
tions are coupling-independent when the coupling appears as an overall factor in the 
classical action, like h), changing the sign of the coupling changes its sign relative to 
G3,: The result is that this theory can be made asymptotically free only if its potential 
has the wrong sign (negative for large ¢). Thus, although nonabelian vectors are 
required for asymptotic freedom in physical theories, “wrong-sign ¢*” can be used 
as a toy model for studying features associated with asymptotic freedom (especially 


resummation of the perturbation expansion: see section VIIC). 


Note that for (massless) fermions that couple chirally to vectors (as in electroweak 
interactions), cr consists of the contribution from a complex representation but not its 


complex conjugate: Only one of the two Weyl spinors of the Dirac spinor contributes. 
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The result is that the contribution to the vector propagator is half that of the parity- 
invariant case. This fact follows from comparing the calculations of the chiral and 
nonchiral cases without the squared-propagator trick: In Dirac (4-component) spinor 
notation, the y_,’s drop out of the calculation; in Weyl (2-component) spinor notation, 
the left- and right-handed-spinor diagrams are identical except for (internal) group 
theory. (Things are more complicated for higher-point functions, because the group 


theory gives more than just tre(G;G;): See subsection VIIIB3.) 
Exercise VITIA3.2 


Find the conditions for exact cancellation if spins $ and 0 include both adjoint 
and defining representations. Find the weaker conditions if only the divergent 


(and therefore also high-energy) terms cancel. 


Exercise VIIIA3.3 
Use the optical theorem to find the decay rate for a massive vector (e.g., Z 


boson) into massive particle-antiparticle pairs of various spins. 


Exercise VITIA3.4 
Find the propagator correction for internal particles of different masses on 


each of the two lines (e.g., for a W boson propagator). 


In the case of QCD, with color gauge group SU(N.) and Ny flavors of quarks in 
the defining representation of color, the divergent and high-energy contributions to 


this term in the unrenormalized 1-loop effective action are 
I\\,29,QCD ah ur fae aF°L(QNy = 11N-)(2 —In DO) Fru 


At higher loops the effective action will still be gauge invariant in background-field 
gauges (for the quantum fields), so the renormalization of the Yang-Mills coupling 
can still be determined from just the gluon propagator correction. On the other hand, 
in other gauges a three-point vertex must also be calculated: It can be shown that 
the gauge-fixed classical action, including counterterms, is BRST invariant only up 
to wave-function renormalizations; i.e, the most general counterterms needed (with 
a BRST preserving regularization) are BRST-invariant terms with additional mul- 
tiplicative renormalizations of the quantum fields. Thus, BRST invariance, unlike 
gauge invariance, is not strong enough to relate the gluon coupling and wave-function 
renormalizations. Not only does this mean evaluating many more graphs, but graphs 
which make the propagator correction look easy by comparison. (This is not so dif- 
ficult for just the one-loop divergences we have considered, but the difficulty grows 


exponentially with the number of loops.) 
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However, in the background-field gauge the L-loop propagator correction has 
(L — 1)-loop vertex subdivergences, similar to those in other gauges. The net re- 
sult is: (1) We still have a BRST-invariant “classical” action, containing the same 
(counter)terms that appear in other gauges (including quantum ghosts), but covari- 
antized with respect to background gauge fields (and including coupling to other 
background fields). However, the coefficients need be calculated only to order L — 1 
for the L-loop effective action, one loop less than in other gauges. (2) In addition, 
we have background-field-only terms in the classical action whose L-loop coefficients 
do need to be calculated, but with a relatively small amount of additional effort, 
due to gauge invariance. Thus renormalization consists of two steps: (1) adding 
BRST-invariant counterterms for the quantum fields (background covariantized) to 
cancel subdivergences, and (2) adding gauge-invariant counterterms for the back- 
ground fields (which can be interpreted as vacuum renormalization for the quantum 
fields) to cancel superficial divergences. Consequently, background-field gauges save 
about one loop of difficulty as far as renormalization is concerned. Furthermore, 
similar simplifications occur for calculations of finite parts (e.g., effective potentials), 


because of simplifications from gauge invariance. 


4. Grand Unified Theories 


The best result of GUTs is their prediction that the gauge couplings of the Stan- 
dard Model coincide at some high energy, as a consequence of the running of the 
couplings with energy. (Mixed results have been obtained for masses, arguably be- 
cause renormalization group arguments are accurate only for high energies, and thus 
leptons with large masses. A “failed” prediction is proton decay, which has already 
eliminated the nonsupersymmetric SU(5) model with minimal Higgs.) The numerical 
details of this prediction are model dependent (and thus easy to fudge, given enough 
freedom in choice of nonminimal fields), but the fact that all three couplings come 


close together at high energies is already strong evidence in favor of unification. 


Thus we make only the crudest form of this calculation, using only the one-loop 
results of the previous subsection. The main assumption is that there is a “desert” 
between the Standard Model unification scale (around the masses of the intermediate 
vector bosons W and Z) and the Grand Unification scale Mey, with no fundamental 
particles with masses in that range (although, of course, a huge number of hadrons 
appear there). This allows us to crudely approximate all fundamental particles below 


that region (i.e., those of the Standard Model) as massless, and all above as infinitely 
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massive. In particular, in the framework of the minimal SU(5) GUT, this means all 


the fermions are treated as massless. 

Therefore the calculation is to use the one-loop results to calculate the running 
of the couplings in the Standard Model, and use the relation of the gauge couplings 
in the SU(5) GUT to identify those of the Standard Model in terms of that of this 
GUT. From the previous subsection, the running of the couplings is given by 


1 1 _ Bln (“eur ) Bi _ d¢p(—1)*8(4s” i 1) 
a) ary m2 J? ~ > 3 
g°(—p*) 9 p oF 


for two helicities of spin s (with an extra factor of $ for only 1 helicity of spin 0), 
where go = g(M2y7). 

If we use gi, 92,93 to label the couplings of U(1), SU(2), and SU(3) that are 
identified with the single SU(5) gauge coupling at the unification scale, then their 
relation to those of the Standard Model (as normalized in exercise [VB2.1) is 
Ag cos" Ow L 


1 sin? Oy 1 1 
’ 2 2 2 2 a 
2 g € 


93 9? 


where gs and g the usual SU(3) and SU(2) couplings, and the factor of £ is because 
the U(1) generator (see subsection IVB4) satisfies trp(G?) = 2 in terms of SU(5) 
matrices. (We generally normalize to trp(G*) = 1 for each generator. Physical 
couplings are preserved if changes in normalization of generators are accompanied by 


changes in coupling normalization so as to preserve g;G;.) 


Then the values of the (3,’s for the Standard Model are 


fii =0-4-=-f; Pia= 2-4-5=%, Aj3=1l1-4+0=7 


where we have listed the contributions from spins 1, $ (for 3 families), 0, respec- 
tively. (Note that the spinors contribute the same to each because they are all 
effectively massless: They don’t notice the SU(5) breaking. Also, we can ignore 
SU(2)®U(1) breaking when calculating these (’s, since we have neglected the corre- 


sponding masses. ) 


Exercise VITIA4.1 
Calculate the contribution of the spinors to the (,’s, in terms of both SU(5) 
and SU(3)®@SU(2)@U(1) multiplets. (Note the chiral couplings for spinors, 
so for cr a complex representation and its complex conjugate might not both 


contribute. ) 
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The experimental values of the couplings (in the MS prescription) at p= Mz ~ 
91 GeV are i ‘ 
= ~ 804, —sin*Ow %.231, == © 106 
€ 


s 


Unfortunately, taking any two of the equations for 1/g? gives widely varying answers: 


e.£., 


Megur ~ 1G="Gev 


Alternatively, since we have used only two parameters to fit three experimental num- 
bers, we can try to predict the value of any one of e, Oyj, or g, from the rest: e.g., 
from e and g, we can find 

sin?Oy & .207 


which shows the same disagreement (but looks better than the exponentiated error 
for Meur). 


The result is not very accurate, since we have made many approximations, which 
can be improved with some effort: Two-loop corrections add In In terms to the one- 
loop In terms; including the mass dependence of the effective couplings also adds 
significant corrections. But the most important approximation assumption we made 
was the desert: Undiscovered particles, such as new fermions, nonminimal Higgs, or 
supersymmetric partners, change even the one-loop expressions (3. Specifically, since 
by definition the unification scale is where the masses of all unobserved vectors reside, 
these new particles will all have spins 0 or 5, and thus make the 3’s more negative. 


In particular, supersymmetrization yields a result consistent with experiment, with 
Mcour © 2.2 x 10'°GeV 


(This has been interpreted as the only experimental verification of supersymmetry. ) 
Exercise VITIA4.2 
Let’s examine the effects of supersymmetry: 
a Supersymmetrize the Standard Model contributions to @, by adding the su- 
persymmetric partners to each spin: 1 > 1@ 5; ; — ; €0@0,0- 50 0@0 
(where the Higgs scalars have doubled because chiral scalar superfields can’t 


satisfy reality conditions) to find the result 
fia =0-6-3=-8 fio =6-6-1=-1, Ai3=9-6+0=3 


b Solve for 1/g, In(M2y7/MZ) (and thus Meur), and sin?6y in terms of 1/e? 


and 1/g?. Then plug in to find the numerical values. 
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c Show the consistency condition relating the 3 couplings is 


Bie a ths P13 = de Pia 2 cif 
gy 92 93 


and that the closest integer values for the couplings from the above data, 


ween - =aiks = = 106 

aT 93 93 
satisfy it exactly. (OK, so this is just a numerical coincidence, considering 
experimental inaccuracies and theoretical approximations, but isn’t it still 
nice?) Also, note that 


=> sin’Oy = 2 


d Drop the contributions of the Higgs (and its superpartners) to the (’s in both 
the supersymmetric and nonsupersymmetric cases, and reevaluate sin?Oy, 


showing both give the same (poor) value. (Thus, Higgs can make a difference.) 


5. Supermatter 


Although the problem with infrared renormalons may be only technical, the ap- 
pearance of this same problem in several different approaches (including a nonper- 
turbative one; see subsection VIIIB7) strongly suggests that the “correct” approach 
to quantum field theory, in the sense of a practical method for unambiguously (i.e., 
with predictive power) calculating perturbative and nonperturbative effects, might 
be to consider only theories that are perturbatively finite. In this subsection we will 
analyze general properties of supersymmetric field theory using superspace, and in 


particular improved UV behavior, concentrating on finite theories. 


Finite supersymmetric theories must be in particular one-loop finite. This turns 
out to be enough to guarantee finiteness to all loops: Two-loop finiteness is automatic, 
while an appropriate renormalization prescription is required to guarantee finiteness 
is preserved order by order in perturbation theory. (No constraints on the coupling 
constants are needed beyond those found at one loop, but without the renormalization 
prescription infinities cancel between different loop orders.) Of course, wave-function 
renormalizations are gauge dependent: N=1 supersymmetric gauges eliminate some 
of these unphysical divergences (and gauges with higher supersymmetry more), as 
do background-field gauges even in nonsupersymmetric theories. So, “finite theory” 
in general gauges refers only to the “physical” divergences — those that affect the 


high-energy behavior of the theory, namely those that appear in couplings and masses. 
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Because of the nonrenormalization of chiral terms in the action (see subsection 
VIC5), it might seem that the corresponding couplings and masses are always un- 
renormalized. However, the kinetic terms of chiral superfields can receive quantum 
corrections, and the true couplings are defined by field redefinitions that eliminate 
these rescalings. This means that all such renormalizations are related, and given 
by the wave function renormalizations. The only other couplings are the Yang-Mills 
ones, whose renormalization is also given by kinetic terms in background-field gauges. 
Thus, all “physical” renormalizations in supersymmetric theories can be found from 
just propagator corrections. In particular, this means that if the effective action is 
calculated with background-field supergraphs, then it is completely finite in a finite 
theory. 


A possible exception to our statement of all physical renormalizations coming from 
propagator corrections would seem to be the Fayet-Iliopoulos tadpole term f d*6 V. 
However, massless tadpoles vanish in dimensional regularization, and massive ones 
require real representations, which cannot generate explicit-prepotential terms. (In 
particular, at more than one loop such terms never appear in the background-field 


gauge for any representation.) 


42 


The simplest one-loop propagator correction is to ¢¢. (The ¢¢@ correction van- 
ishes, since [ d‘6 ¢? = 0: See subsection VIC5.) There are two graphs to consider, 
one with two internal ¢¢ propagators, and one with internal ¢¢ and VV propagators. 
The d algebra for the two graphs is identical: Both get a d? and a d? inside the loop, 
exactly enough to give a nonvanishing graph (using [d?d?64(@ — 6’)]|g¢ = 1). There 
is also a symmetry factor of 5 for the two $¢ propagators, and a —1 for the mixed 
graph because the two different types of internal propagator have opposite sign (and, 
as usual, an overall —1 to get I’ from the T-matrix). Thus, the supersymmetry (spin) 


part of the algebra is almost trivial in this case. 


Exercise VIIIA5.1 
Use component methods to evaluate the first graph with external fermions: 
the contribution of the Yukawa interaction to the fermion propagator. Show 


it agrees with the supergraph evaluation. 


On the other hand, the internal group theory is slightly messy, so we treat the 


general case immediately: We take vector multiplets V‘ for an arbitrary group (though 


526 VII. GAUGE LOOPS 


we will need a semisimple group for finiteness, since Abelian groups are not even 
asymptotically free). Sums $7, are over each simple subgroup (or each Abelian 
factor), since they can have independent coupling constants grec for representation R 
(especially gc for the adjoint, which we use for the pure super Yang-Mills term for 
definiteness; except for the Abelian factors, where a nontrivial representation should 
be substituted). Similarly, sums }/, are over irreducible representations of the group; 
ITp;7 is the corresponding projection operator. For the simple (or single-component 
Abelian) factors of the group 
nei = Cacll aij 


is used (but again, with a different normalization for the Abelian factors). We also 
use the group theory identities (normalizations) from subsection IB2, now generalized 


to these nonsimple groups and reducible representations: 


C 
Ko De RG 
Gia Gyx" ay = y crclleiy = S ——“Neig 


C 
G G CAG 

CPG Ga) te ay ae 

il jK Ne = RGLLRI = d RI 
R R CAGGR 


Then from the Lagrangian 


—— ‘i d*6 d'(e") "os + ( i 0 tr bb 50K + he.) 


i] 
— y = [es sW*W? nei; 
a FAG 


(ignoring mass terms) the result is simply 


r; * crad - 
Figg =h / d‘6 6' My" Ardy, = Mr’ => 0 gag ~ a ae ee re 
RG GYR 


where again A, is the operator representing the one-loop propagator correction (T- 
matrix) for self-interacting scalars. (Of course, this operator may vary depending on 
the internal masses; here we are concerned mostly with the divergences and leading 
high-energy behavior, which is mass-independent. As usual, we can rescale the gauge 
fields by their couplings in the Lagrangian; this moves these couplings from the propa- 
gators into the vertices, giving the same result for this term in J’, since it has no V’s.) 
Of course, this is the identical group theory that appears in the nonsupersymmetric 
case; we have been more general here because we want to consider exact cancellation, 


while in the nonsupersymmetric case simplicity is usually more important. 
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6. Supergluon 


The supergluon self-energy calculation is similar to the nonsupersymmetric cases 
considered in subsections VITIA2-3. Examining the Feynman rules, we see that those 
for the vector multiplets are similar to the nonsupersymmetric ones for vectors (as 
expected), while those for the scalar multiplets are similar to those for spinors: d? 


and d? are analogous to (in 2x2 matrix notation) 0 and 0*, etc. 


There are now only two kinds of loops to consider, vector and scalar multiplets: 
As for the nonsupersymmetric case, ghosts in background-field gauges couple the 
same as matter, since at one loop the only coupling is to background fields and thus 
covariant, even for ghosts. For the real scalar superfield describing the quantum 
vector multiplet, looking at the terms in the action quadratic in V (from subsection 
VIB10) 


Spy = / de d'0 1V(O1 + 21W°D,, + 21" Ds )V 


we see that vertices have only 1 spinor derivative at most. However, we need at 
least 4 spinor derivatives (2 d’s and 2 d’s) per loop (see subsection VIC5), since the 
result of reducing any loop to a point in @ space always leaves the tadpole 6-integral 
[d...d6*(@ — 0’)]|g-=9, which vanishes for fewer than 4 derivatives. Thus, a V loop in 
a super Yang-Mills background vanishes for fewer than 4 external lines. This means 
the entire contribution of quantum super Yang-Mills to the supergluon propagator 
correction (or 3-point correction from real representations) in the background-field 
gauge comes from the 3 ghosts (including the Nielsen-Kallosh ghost), which couple 
the same as —3 scalar multiplets in the adjoint representation. Thus, for example, 
we see without evaluating a single graph that this correction vanishes for N=4 super 


Yang-Mills, which has also 3 physical adjoint scalar multiplets. (See subsection IVC7.) 


For the scalar multiplets, we can find the analog of the squared-propagator trick: 
The easiest way is by the method of subsection IIIC4, which automatically takes 
care of factors of s, and can be applied classically, without worrying about functional 
determinants. This method requires we consider the massive theory at intermediate 
stages of the calculation, although the mass can be dropped at the end. The only 
resulting limitation is that we must restrict to real representations of the gauge group. 
(In other words, the couplings must preserve parity: For these terms, CP invariance is 
automatic, and reality means C invariance, so P invariance is implied.) However, this 
is a restriction of the usefulness of the squared-propagator trick anyway: Otherwise we 
get expressions like (@ +7A)(@—7iA*) which do not yield useful simplifications. (They 
require as much work as without the trick.) In such cases we are stuck with doing the 


calculations the hard way. This is not just a technical difficulty, it is a consequence 
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of the final result being messier in such cases: For example, for real representations 
there is no possibility of anomalies. However, we can separate the generators into 
the real (scalar) and imaginary (pseudoscalar) ones: Then this trick simplifies the 
real (polar vector) couplings but not the imaginary (axial vector) couplings. (As for 
Pauli-Villars in subsection VIIB2 below, but also for the physical fields before taking 
the mass to vanish after the trick has been applied, the mass term can be chosen to 


preserve the polar symmetries and thus violate the axial ones.) 


However, by comparison of the propagator correction for complex and real rep- 
resentations without (the supersymmetric version of) the squared-propagator trick, 
we see that the only difference between the two is in the (Yang-Mills) group theory. 
Thus, we can calculate for real representations first, using the trick, and then for 
complex representations by simply replacing the group-theory factor in the result for 


the real ones. 


Repeating the procedure of subsection III]C4 with spinors replaced with chiral 
superfields, we begin with the Lagrangian (S = [ dx L) 


t= = [a'oso+ (fae pers [a9 46°) 
where the chiral superfields are covariantly chiral (or background-covariantly chiral) 
Vad = Vad = 0 


Treating @ as auxiliary (the ¢? term has no Yang-Mills coupling, as can be seen, e.g., 


in an “antichiral” representation), we eliminate it by its algebraic field equation 


d= “v6 
After a trivial rescaling 
G3 2nd 


(and using [ d*@ = f[ d?0 VY’) we obtain the action 
Lo =— f a0 3000? — $m2}o =~ f 20 4o( m2 + ilWV2, Valo 


(O = V’V,,) using an identity from subsection VIC5. 


In the chiral vacuum-bubble loop, we no longer have an explicit chiral superfield 
to convert VV? to O +.... However, using the chiral representation V = d?, we can 


write the kinetic operator as 


PV? = PP + (VW? — a) 
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to separate the truly free part from the background interactions. Then quantization 
can be preformed as usual (see subsection VIC5): Essentially, we can now use the 
free —Oh + m? = p? + m? as kinetic operator, since at each vertex there is a d? to 
project back to chiral superfields. Of course, in general we need only one projector in 
any trace over a subspace: In this case that result is obtained by integrating the d?’s 
by parts in the loop back and forth across the free propagators, since sandwiching 


any V? — d? between them produces 
d?(V? —d?)d? = 4(O — Op + i[W%, Va]? 


Repeating the procedure till only one d? is left, the Feynman rules for this loop become 


1 
ropagator: ——-——6*(@ — 6 
propag iP +m) ( ) 
one vertex : d?(V? — d’) 


other vertices : 5(O — Oo + 3[W°, Va]) 


We thus see that one vertex has at most 3 derivatives (d?d) while the other has 
at most 1 (d): 
d?(V? — d’) = @{iACd, + Zi(d° Aq) — SA%Aa| 
5(O—Oo +7[W%, Val) = iW da + 3i(d°Wa) — $[W%, Ao] Hi A%0q + 4i(0%Aq) — $A%AG 
exactly the minimum needed. (Thus, there are insufficient derivatives for a tadpole 
contribution to the propagator.) The result for this diagram is then the same as the 
corresponding diagram in bosonic y® theory, with a group theory factor tr(G;G;,), 


and replacing y(—p)y(p) with 


/ d'6 d*0' (iW'*(—p, 0’) dl,54(0 — 6") [iA29(p, 0) d2dg6*(0 — 6')] 


= i ‘9 4W'?(—p, 0) Ad (p, 0) = fi a0 3W'*(—p, 0)Wi(p, 8) 


using dd = —d'6, integration by parts, [d?d?d+(0—6’)]|g-—9 = 1, and W, = d? Ag (chiral 
representation). Written in the notation of subsection VIIIA3, the 2-supergluon part 


of the unrenormalized 1-loop effective action is then 
Paey =—h tr faa d’6 LW(4cr) A.W, 


for a scalar multiplet, and exactly —3 times that for a vector multiplet, including the 
massive case. Thus, cancellations again survive the introduction of masses. Also, if 


the masses of the various scalar multiplets are equal the entire propagator correction 
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is canceled in such theories, while for unequal masses only the divergence, and the 


corresponding leading (logarithmic) high-energy term, is canceled. 


Exercise VIITA6.1 
Show this result agrees with the restriction to N=1 supersymmetric theories 


of the component result of subsection VIITA3. 


Exercise VITIA6.2 
Take the result of subsection VIJA3 literally for all spins s (arbitrarily large). 
Using the fact that multiplets with N+1 supersymmetries can be written as 
2 multiplets with N supersymmetries, differing in maximum helicity by 1/2, 
recursively find the result for general s (now labeling maximum helicity) for 
all values of N> 1, and show it vanishes for N> 3. 


Exercise VIITA6.3 
Calculate the chiral scalar contribution to the one-loop supergluon propagator 
correction without the squared-propagator trick. (Hint: There are 8 spinor 
derivatives in the loop. Integrating them by parts off one propagator produces 
3 terms, since the number of d’s and d’s inside must be equal, because what’s 
left is always spacetime derivatives on [d?d?5*(6 — 6’)]|g:—9.) 

Generalizing the group theory as in the previous subsection, we have the total 


result 
Kawah f dete SW" Me)nasAWi, — Ma =~ 
G 


(For Abelian factors, irrelevant for finiteness, we should take the c4g factor out of 
Noij and put it into Mg; then cag = 0 for Abelian groups, so Mg > So pcre > 0.) 
Therefore, combining with the results of the previous subsection, the conditions for 


finiteness are 


CRG » Credag,, J ix JKL 
y — =3, y Gae—__q_ tri = 5XIKLA 
Pan cacdr 


Cc 
R AG 


In particular, for the case of N=4 super Yang-Mills written in terms of N=1 superfields 
(see subsection IVC7), we have 3 adjoint chiral scalars ¢; with I = il’, where 7 is 
the adjoint label and J’ = 1, 2,3 (which appeared as the label J in subsection IVC7, 


where the adjoint label was implicit in matrix notation). Then 


IJK igk I I'K! 
Aw” = gaf?*e 
and the above two finiteness conditions reduce to 


J J’ 1 J'K'L!' J jkl 
On = 3, On = FEI K'LIE (0; = Fat? ) 
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As explained in the previous subsection, in the general case the finiteness conditions 
may receive quantum corrections at 3 loops and beyond, depending on the model and 


renormalization prescription, but no new conditions are added. 


Presently there is no deep understanding for the finiteness of these models (at 
least, not deep enough to always avoid the quantum corrections to the finiteness con- 
ditions). Note that they are finite for arbitrary values of the couplings, up to the 
two above restrictions: For example, we can scale all the couplings by a common 
factor. Thus, they are finite order-by-order in perturbation theory (loops). Non- 
supersymmetric theories can also be finite, but only for specific numerical values of 
the coupling, i.e., not for arbitrarily small values of the coupling, and thus not order- 
by-order in the loop expansion; they therefore suffer from the renormalon problem. 
(The renormalon-like behavior of instantons is not a problem in the framework of the 
1/N, expansion.) The finiteness of theories with extended supersymmetry has been 
explained by various arguments (in particular, for N=2 there are no divergences be- 
yond 1 loop even for theories that are just renormalizable), but none of these applies 


to the general case of simple supersymmetry. 


To obtain more realistic models, we may want to consider adding “soft” super- 
symmetry breaking terms (those which have little effect on high-energy behavior), as 
introduced in subsection IVC6, to these finite theories. Finiteness can be maintained, 
but the conditions become considerably more complicated in the general case. Note 
that spontaneous breaking of supersymmetry is not allowed, because the first condi- 
tion prohibits U(1) factors (with c4g = 0; thus no [ d*@ V terms), while the second 
prohibits gauge-singlet matter (with crg = 0; thus no [ d?6 ¢ terms). 


7. Schwinger model 


The simplest interacting model in D=2 is the “Schwinger model”, massless QED. 
This theory is even simpler than scalar theories because its interactions occur only 
through a massless gauge vector, which has no physical polarizations in two dimen- 
sions (D—2=0). 

The most interesting feature of the Schwinger model is that all amplitudes with 
external vectors can be calculated exactly. In fact, the only nonvanishing 1PI vector 
amplitude is the one-loop propagator correction, which gives just a mass term. In 
that sense the theory is trivial, and describes just a massive vector. However, the 
methods of calculation are instructive. We first consider some simple methods of 
calculation of just the propagator correction, and then show that it is the only 1PI 


vector graph. One method we have already considered is dimensional regularization; 
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from subsection VII[A2-3 we have the contribution to the effective action (correcting 
for the 2D normalization tr(/) = 2) 


1 
n= | de PSP 


where we write Fy, = €gF in D=2. Although this calculation needs no renormal- 
ization, regularization is still necessary to allow naive manipulation of the integrand: 
Using dimensional regularization, we see from the result of subsection VIITA3 that 
we get a factor of aa —4s? ~ ein D=2+4 2c, canceling the 1/e pole from the scalar 


integral. 


It can also be calculated in position space, using the methods of the previous 


subsection. The Lagrangian in lightcone notation is 


L=—gZaF” + |We(—idg6 + Aco)e + Yo(—tOne + Ao) al 


We can calculate separately the contributions of we and We to fermion loops. The 
“photon” propagator correction consists of the product of two fermion propagators, 
as given in the previous subsection. We then find for the effective action (including 
another —1 for T — I anda 5 for identical external lines), after including a finite 


counterterm to restore gauge invariance, 


1 1 1 
aA (81)" 5A ++ 2A.(8-)" aA —= A, A_ = Par 


(after integration by parts). 

This same calculation also gives the “axial anomaly”: Consider an axial vector 
gauge field B that couples to the current +a. (not summed), in addition to A’s 
coupling to wate. (In D=2, W, = «°VYs => We = +V2.) The contribution to 


the 1-loop effective action with one of each vector externally is, after including a 


counterterm to preserve A gauge invariance (and therefore break B gauge invariance), 


The anomaly is the breaking of B gauge invariance, 

or 
B — es T = . —— =? F 
C) OX => 6 / AO 5B / 


An anomaly is by definition a quantum effect: As we have seen from the 2D 


axial anomaly, it is related to a divergence that violates naive classical arguments, 
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since the regulator itself violates the symmetry. In the axial case there is no actual 
divergent term in the effective action, but a finite term results from a ¢/e type of 
cancellation. Dimensional analysis immediately reveals that the propagator correction 
is the only graph in D=2 that can contribute such a term from the fermion loop. 
(Fermion propagators go as 1/p, while the vertex is a constant: The electric charge 
has dimension in D4.) 


The complete one-loop effective action for the vectors then follows directly from 
the complete anomaly for the axial current, and the vanishing of the anomaly for the 


polar current: By separating out the anomalous term in the effective action, 


~ 6A’ 6A 


r= f Par+ar: po! Age 


Ogd=0, 0% d=] —2F = oO-(Al)=o0 x (Ad) =0: =] Ava) =]. Ar =0 


(up to an irrelevant constant), where 0 x J = €”0, Jy is the curl of the polar current, 
but also the divergence of the axial current. (There are some questions of boundary 
conditions in solving the divergence- and curl-free conditions as AJ = 0, but these 


are resolved by working in Euclidean momentum space.) 


Similar remarks apply to external gravity: From a similar calculation, replacing 
the vector current with the energy-momentum tensor, we find 


1 
An T™ = 0, One™nT? PMR => Tn R, Dw ROR 


where R is the 2D curvature (which is just a scalar, as the vector field strength is 
a pseudoscalar). While in the vector case the finite local counterterm was chosen to 
preserve polar gauge invariance and thus violate axial, for the tensor case a term is 
chosen to preserve local conservation of energy-momentum and thus violate conformal 
invariance T™,,, = 0. (The above expressions are linearized, but the results can be 


generalized to fully nonlinear gravity.) 


Exercise VITIA7.1 
Calculate the gravitational anomalies from a massless spinor loop in D=2, 


using the classical expressions (as follow from dimensional and Lorentz anal- 


ysis) 
= te(—31)O+ve, — Po(—31)0_de, i= 0 


(If you work in terms of I’ you can define the perturbative field h,, such that 
6 jiha=T™.) 
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The simple form of the effective actions in the Schwinger model is a consequence 
of bosonization: Thus, including coupling to electromagnetism and gravity, the action 


for the massless spinor is equivalent to 
L=—7¢0¢ + (F + R)¢ 


Integrating out the scalar generates the above effective actions classically. 


Exercise VIIITA7.2 


The above action is dual to the mass term of the Stiickelberg action: 


a Consider the first-order Lagrangian 
L = —G? + G*(mA, + 0.4) 


Eliminating the auxiliary field G, by its field equation yields the usual mass 
term for the Stiickelberg model. Show that if we vary ¢ instead and solve 
the resulting constraint on G, we obtain (the nongravitational part of) the 


previous action. 


b Generalize this construction to D=4, where the field dual to the Stiickelberg 


scalar is now an antisymmetric tensor gauge field. (See exercise IIB2.1.) 
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In general, the only loop corrections that can be evaluated exactly in terms of el- 
ementary functions are the one-loop propagator corrections. However, limiting forms 
of vertex corrections, for various low- or high-energy limits, explicitly yield the most 


important pieces for certain applications. 


1. JWKB 


Some low-energy contributions to the effective action can be obtained by vari- 
ous quantum mechanical JWKB approximations. This involves an expansion of the 
external field about its vacuum value in spacetime derivatives (momenta). Such an 
expansion makes sense if this field is massless, since then small spatial momentum 
means also small energy, in the relativistic sense. (Otherwise one needs to expand 
nonrelativistically, about p= 0 but E =m. Such treatments were considered in sub- 
section IIB5, and will be applied to loops in subsection VIIB6.) It also can be useful 
when the mass of the external field is small compared to the mass scale relevant to 
the interactions, such as for chiral symmetry breaking in the low-energy description 
of light mesons (subsection IVA4). 


On the other hand, the fields we are integrating out must be massive, with a mass 
greater than the energy we want to investigate: Otherwise, the internal particles 
would show up as poles (and cuts) in the amplitudes, where Taylor expansion in 
momenta would be a poor approximation. ‘The basic principle for analyzing the 
behavior of such a theory in a certain energy range is thus to first find contributions 
to the effective action where: (1) only particles with masses of lower energy appear 
on external (background) lines, and (2) only particles with masses of higher energy 
appear on internal (quantum) lines. These contributions are approximated by Taylor 
expansion to finite order in external momenta, yielding a local effective action. We 
could then consider finishing the functional integration by integrating out the lighter 
particles on internal lines: However, in this approximation it would be inaccurate to 
consider such particles in loops, since there they would include energies above the 
approximation scale. Thus, the effective action obtained by integrating out just the 
heavier fields is useful only when the lighter fields are treated classically. We apply 
the same approximation scheme to the classical action: Eliminate the heavier fields by 
their classical equations of motion, and Taylor expand their propagators in momenta 


to the desired order to get a local result. 


538 VII. GAUGE LOOPS 


In subsection VIIB2 we saw the simplest example, the effective potential: In that 
case the constant background scalar field acted as just a correction to the mass. We 
now consider more complicated cases, where spin and gauge invariance play roles for 
the internal or external fields. In particular, adding coordinate dependence to the 
background fields means we need to consider more general propagators for quadratic 


kinetic operators, such as harmonic oscillators. 


We saw in subsection VIB1 the most general relativistic particle action for a scalar 
in external fields that was quadratic in 2 and #. We now consider such actions in more 
detail: They are the most general ones for which we can derive one-loop results to all 
orders in the external fields (i.e., without performing the JWKB expansion beyond the 
first quantum correction, which requires Taylor expanding the exponential in terms 


that are beyond quadratic, thus expanding in the number of external fields). 


Without loss of generality, we can consider Lagrangians that are homogeneous 
of second order in x and #@: Terms linear in ¢ are boundary (in 7) terms, and were 
already eliminated by a gauge transformation (radial gauge). Terms linear in x can 
be removed by a translation, in the presence of an x? term (which is needed to bound 
an x term in the potential). (Both these kinds of terms can be restored trivially at 
the end.) A constant term is also trivial, giving a contribution to the classical action 
that is just that times T (after integration cig dr), and can be treated separately. (It 
doesn’t contribute to the equations of motion.) The remaining contribution to the 


mechanics action is then of the form (as usual, in the gauge v = 1) 


T 
s= f dr 5[-d° +2At+cBr] => #%+At+Br=0 
0 


vi 
= f= | dr 3(—4? — x#) = —4(at)|5 
0 


where A is an antisymmetric matrix and B symmetric. The steps to this contribution 


to the one-loop effective field action are then: 


(1) Solve the equations of motion, which are homogeneous second-order differential 
equations. 


(2) Change variables from the two parameters used for each x to x(0) and 2(T). 
(Second-order differential equations require two initial conditions, or one initial 


and one final.) 


(3) Find $(2(0),2(T)), including separately the contribution from the constant term 


in the Lagrangian. 


(4) Find the propagator for “time” 7, including the e~** and the van Vleck determi- 
nant. (See exercise VA2.4.) 
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(5) Integrate the propagator over T to find I’. (See subsection VIIB2.) 


For example, consider in QED the contribution to I’ from a fermion loop. If we 
are interested in only the properties of photons, then this gives the entire contribu- 
tion to the functional integral from integrating out the fermions: This contribution, 
plus the classical (free) Maxwell action, gives a nonlocal “classical” action of self- 
interacting photons, which can itself be quantized to give the exact QED result for 
external photons. Although this one-loop effective action is too difficult to calculate 
exactly, the first-quantized JWKB approximation can give an accurate description 
at energies small compared to the electron mass. Note that we are simultaneously 
approximating to the first quantum correction in JWKB expansions of both the field 


(second-quantized) type (one-loop) and the mechanics (first-quantized) type. 


The mechanics action for a massive particle in a constant external electromagnetic 
field strength (the lowest nontrivial order, but also the highest that keeps the action 
quadratic), in the radial gauge for the background field and affine parametrization of 


the worldline, is (see subsection VIB1) 
= fe 5(-2? + 2° Ft? + M") 
To include spin, we identify (see subsection VIIA3) 
M* =n —iS"F, 


Since the only appearance of spin operators in the calculation of the propagator 
(denominator) is this constant matrix, it commutes with everything, so we can treat 


it as a number till the last step. The equation of motion 
¥+F£=0 


is easily solved in matrix notation. (Hint: Solve for # first.) Finding x; = x(0) and 
xf = x(T) in terms of our integration parameters and inverting, then expressing <(0) 
and £(7’) in terms of x; and wy (and T and F’), and making use of the antisymmetry 
of F’, the result is 


S = —t(ay5 — 24) F coth(42) (x; —2;) — 4a Fx; + 4M’°T 


L Fr 
4 2 


The propagator is then given by (see subsections VA2 and VIIB2) 


O?(-iS) is 


~iTH 
x fle x;) = ,/det ————e 
(rl lei) Ox OX; 
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Plugging in, and then Wick rotating T — —77’, we find for the propagator with ends 
tied together 


iP 
(rle~7# |x) — det ioe ~M?°T/2 


Finally, the contribution to the effective action is (see subsection VIIB2) 


r= ~e | de [F nd (ut FF goin)? 4 | de penn) 


- LET 
= -c f av f gr Teste wee | det <I aC nana oe tn) 
0 


_ e tFT 


where c = —3 for fermions, for statistics and squaring the propagator. (The “det” is 
for the vector indices on F,,”, the “tr” is for the spin indices from powers of S@ in 
eS P F or) .) 

Exercise VIIIB1.1 


Explicitly evaluate the determinant and trace for D=2. 


Exercise VIIIB1.2 
Expand I" in F and show the resulting F? terms agree with those obtained 
in subsection VITIA2-3. 


Exercise VIIIB1.3 


Consider the quadratic action 
f= fe 3(—4? + x(a — a") — xaa'z] 


where the matrix a commutes with its transpose ({a,a’] = 0). Solve the field 


equations for S(x;,x7;T). Find (z\e~7" |z). 


2. Axial anomaly 


The axial anomaly comes from a finite graph, as we have already seen in subsection 
VIIA7 for the case D=2. However, the naive manipulations that would show the 
graph to preserve gauge invariance involve evaluating the finite difference between 
divergent graphs, each of which needs regularization. Although in some cases the 
graph can be evaluated explicitly, and then shown to be anomalous, it is generally 


easier, and more instructive, to analyze the anomaly by itself. 


The axial anomaly is associated with the use of € tensors. In renormalizable 
theories in D=4, these occur only through y_,’s for spinors. (In nonrenormalizable 


theories, or in D=2, € tensors can occur in scalar theories. There is also the term 
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ced i. Fg, which is a total divergence, and has no effect in perturbation theory.) 
In general even dimensions, the massless kinetic term for a spinor is invariant under 
transformations generated by y_;, but the mass term is not. Chiral symmetry is thus 
related to masslessness; this is also true for conformal invariance, so it’s not surpris- 
ing that quantum corrections can break both. (In fact, in supersymmetric theories 
conformal symmetry is related to a particular chiral symmetry by supersymmetry, so 


breaking of one requires breaking of the other if supersymmetry is to be preserved.) 


Dimensional regularization manifestly preserves neither conformal nor chiral in- 
variance; no regularization does. The existence of these anomalies proves the impos- 
sibility of such a regularization. Furthermore, dimensional reduction has difficulty 
dealing with y_,; it even has inconsistencies in the presence of axial anomalies. On 
the other hand, Pauli-Villars regularization is especially convenient for dealing with 
axial anomalies because it regularizes by introducing masses. Thus, it breaks chiral 
symmetry explicitly but softly, conveniently parametrizing the breaking by mass pa- 
rameters. We therefore will use Pauli-Villars regularization for the single purpose of 


evaluating the axial anomaly. 


The basic idea of Pauli-Villars regularization is to include massive “ghost” fields 
which would cancel graphs from physical fields if they had the same mass. But the 
masses of the ghosts are used as regulators; after subtracting local divergences, the 
regulator mass is taken to infinity. In our case, as we'll see by explicit evaluation, the 


anomaly itself is finite, so no subtraction is necessary. 


The graph whose anomaly we want to evaluate is a one-loop 1PI graph with 
external vectors and a massless internal spinor. Of the vectors, all but one is a 
“polar” vector, coupling to wy_y, while the last is an “axial” vector, coupling to 
wWy_-1Yat?. These are the currents associated with the symmetries 7’ = e’%y and 
wy’ = e-1y (72, = —5). We add to this graph a similar one, but with a massive 
spinor, and give the second graph an overall relative minus sign. Since the mass 
breaks chiral invariance, we have explicitly broken the gauge invariance of the axial 
vector, while preserving those of the polar vectors. Note that this is a feature of the 
regularization: If a regularization existed that preserved chiral symmetry, then we 


could freely move the y_; around the graph from one vertex to the next using the 
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usual naive anticommutation relations, thus moving also the anomaly from one vertex 


to the next (i.e., violating gauge invariance in any vector we choose). 


Gauge invariance is represented by vanishing divergence of the corresponding 
current: At each vertex we have the coupling { A-.J, with gauge invariance 6A = OA, 
implying 0-J = 0 by integration by parts, where J may be polar or axial depending on 
the vertex. These currents are conserved classically. We know they are also conserved 
quantum mechanically in the absence of y_1’s, since dimensional regularization and 
renormalization preserve the gauge invariance of the effective action. In graphical 
terms, taking the divergence at a vertex kills a propagator (since 0- J is proportional 
to the field equations of the internal field), and this can be shown to lead to vanishing 


of the graph. 


However, with the Pauli-Villars regulator, the classical conservation of the axial 
current is explicitly broken. The result is that the complete axial anomaly can be 
found by looking at just the contribution coming from this explicit classical violation 
of current conservation (inserted into the one-loop graph). (The classically vanish- 
ing contributions are actually nonvanishing because of the anomaly, but they cancel 
between the physical and regulator fields, precisely because the regularization allows 


the naive manipulations that justify dropping them.) 
We therefore want to evaluate the anomaly 


r 
ae 


Ons (2) 6A,(x) 


where we start with a term in the classical action [ A- J, so classically J = 6S/6A, 
and then evaluate its quantum correction by looking at 7 = 6I°/dA in terms of the 
one-loop part of the effective action I’. Classically, we find a contribution from only 
the regulator, 


0. J =0- (-V2idby_ayyp) = 2p 


So, all we need to evaluate is a one-loop diagram with the axial vector coupling to the 
regulator replaced with a pseudoscalar coupling [ dyy_1v, and look at the graphs 
with one external pseudoscalar and the rest polar vectors. Clearly this is the same 
as coupling the pseudoscalar to the propagator of a bosonic spinor regulator in an 


external vector field: 


1 
0-J=2mt a 
wa mr 7? + ine 


Vie (= oe a-iv i ) 
a =) . 


1 
_ 2 
= J/2m tr (aaa za a) 
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where the trace is in the y-matrix space. In the limit m — oo, graphs with more 
external lines vanish more rapidly. On the other hand, we need at least D/2 fac- 
tors of S° (D y-matrices) to give a nonvanishing y-matrix trace. Thus, the leading 
contribution will be, using -2V? = 01+ iS“F,, from subsection IIIC4, 


0-J =—V2m? tr V-15 
2 


1 - cab if - cab 
Gey “age 


(m? — Lo) 3(m? — Lo) 
with D/2+1 propagators, where Oy = (0,)?. 


Thus, the only Feynman diagram we actually need to evaluate is the one-loop 1PI 
diagram with external and internal scalars. The limit internal m — oo is the same 
as the limit external p — 0. (The result does not depend on the internal momentum, 
which is integrated over, nor the external mass, which would appear only in external 
propagators.) Thus, this is just an effective potential calculation. We therefore have 
the integral (see subsection VIIB1) 


fu 1 _ i] 
[ERP + m2) TB) gm 


2/2 
= ~~. f= a tr[y_1(LiS” 3)? 
5)! 


Exercise VIIIB2.1 
Check this result by using the expression from subsection VIIIB1 for the 


propagator in a constant external electromagnetic field (strength). 


To evaluate in arbitrary even D, we note that the normalization of y_; is such 


that we can choose 
(aE = (APPZ GG1oyP 
if(l) = gh. Getta = tr[y_1(44S” Fup) ?/?] = alg) Fay... Fed 


ab---cd 
E Fup..-Fea 


> 0-fJ =2———__ 

22/2(2)| 

Thus, for example, for the Schwinger model (D=2) we have 
O-J=-2F 

in agreement with subsection VIIIA7, while for D=4 


O ff = 56 Bay Fed 


544 VII. GAUGE LOOPS 


3. Anomaly cancellation 


When the anomaly occurs in a current that couples to a gauge field, unitarity 
is destroyed, since gauge invariance implies current conservation. This is a poten- 
tial problem, since axial vector couplings occur in the Standard Model. (Actually, 
they are “V—A”: (vector)—(axial vector).) The only way to avoid this problem is 
to have an anomaly cancellation between the different spinors: The coefficient of 
the anomaly is given purely by group theory, as tr(A{B,C}), where A,B,C are 
the matrices representing the couplings of the three vectors to all spinors, and the 
anticommutator comes from Bose symmetrization (from the crossed and uncrossed 
graph in the S-matrix, or the single contribution multiplying commuting fields in the 
effective action). We therefore require this trace (which represents the sum over all 
spinors) to vanish. (See exercises IB5.3 and VC9.2d for an example of the calculation 
of this trace.) The representations in the Standard Model have been chosen so this 


cancellation occurs in each family. 


We already know in terms of Dirac notation that axial anomalies appear only 
in the presence of y_;’s. An absence of y_;’s is equivalent in terms of Weyl no- 
tation to the use of a (pseudo)real representation for undotted Weyl spinors. For 
example, consider a real representation that is reducible to a smaller (by half) rep- 
resentation “R” and its complex conjugate “R”: Then we can complex conjugate 
the complex-conjugate representation to produce a dotted Weyl spinor that is the 
same representation as the undotted spinor. The undotted and dotted spinor can 
then be combined, as usual, to form a Dirac spinor, which transforms as the complex 
representation, without y_,’s, and thus the same goes for the coupling of the gauge 


vector: 
URa @VRa 7 VRa O VRa > YR 


So, in Dirac notation we can see that such representations do not contribute to anoma- 
lies because of the absence of y_;’s. Similar remarks apply to general real or pseudo- 
real representations: We can take an arbitrary (pseudo)real representation and make 


a Majorana spinor, as 
VRa > VRa B VRa — YR 


where now wpa is simply the complex conjugate of rq since R = R. 


This cancellation also can be seen directly in terms of Weyl spinors: The (pseudo)- 
reality of the representation is charge conjugation invariance (which is equivalent to 
parity invariance for spin-1 couplings to spinors, since such couplings are always CP 


invariant). Anomaly cancellation is then a generalization of Furry’s theorem (see 
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subsection VIIA5). Real and pseudoreal representations of the generators (including 
complex + complex conjugate) are antisymmetric, up to a unitary transformation, 
since they are hermitian: 


GT = G* = -UGU™ 


(so dW = iGw preserves reality or pseudoreality). Thus 
tr(A{B, C}) =tr(—A7{-—B?, —C™}) = -tr(A{B,C}) => tr(A{B,C}) =0 


In particular, any mass term (without Higgs) 7%, requires a real representation (so 
its variation yields G + GT = 0); a pseudoreal representation won’t work because it 
uses an antisymmetric metric which, when combined with Cg, makes y7°w, vanish 
by symmetry (since ~ is anticommuting). A related way to see in Weyl (or Dirac) 
notation that real representations are nonanomalous is to use the same squared- 
propagator trick we used for the propagator correction in subsection VIIIA3 (or re- 
lated complex action from subsection HIC4), which resulted in simplified Feynman 
rules only for real representations: With those rules, there are no potentially diver- 
gent 3-point graphs other than those that already occur for scalars (as part of the 


covariantization of the propagator divergence). 


The absence of y_;’s is a special case of parity invariance. However, even par- 
ity invariance is not enough to enforce cancellation of anomalies, since some parity 
invariant theories have axial gauge vectors, which couple to axial currents Wy_)y,V, 
and the appearance of these y_;’s can be sufficient to introduce anomalies. In these 
anomalous cases, even if there is a C, the charge conjugation argument above does 
not apply because the C following from the usual CP and the obvious P does not 
simply replace A — —A’, but is some other permutation of similar representations. 
Thus, in general P (and C) invariance is unrelated to anomaly cancellation: We can 
have one without the other. Having real representations (i.e., no 7_1’s) is a special 
case of both. 


Exercise VIIIB3.1 
Consider chiral symmetry (as in subsection [VA4 or IVB1) for a single flavor 


— U(1),®@U(1)r. Now gauge that symmetry: 


a In Weyl spinor notation, write the action for massless Weyl spinors Wa, WRa 
each coupled to their own gauge vector. Clearly there is one anomaly for 3 
external A;,’s, due to wz, and another for Ar,, due to wr, and no mixing. 
Now assume the left and right coupling constants are equal (so the anomalies 
are equal). Write the resulting symmetry transformations on all fields under 
Lr, G, end F. 
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b Rewrite this theory in Dirac notation. Using P, find the combinations of A, 
and Apr that are (polar) vector and axial vector. Relate the anomaly cal- 
culations in the two notations. Show that dropping the axial vector gives 
(massless) QED. Find the theory that results from dropping the vector in- 
stead: Give the gauge symmetry, and show it is anomalous, and explain the 
anomaly (vs. the cancellation of the anomaly in QED) in both Weyl and Dirac 


language. 


c Generalize all the above results to U(n),®U(n)r. (Note that C will now 
include complex conjugation on the hermitian matrices for the vectors, so 
that P won’t.) 


The simplest way to prove anomalies cancel in the Standard Model is to use our 
previous results for GUTs (subsection IVB4): (1) One way is to consider the GUT 
gauge group SU(4)@SU(2)@SU(2). First, we note that tr(G;) = 0 because the group 
is semisimple, so there are no mixed anomalies. Then we see that the SU(4) couplings 
are the usual “color”-type couplings, without 7_,’s (i.e., 464), so it has no anomalies. 
On the other hand, SU(2) has only (pseudo)real representations, so neither SU(2) 
has anomalies. Thus, anomalies cancel in the SU(4)®SU(2)@SU(2) GUT. Finally, 
breaking to SU(3)@SU(2)@U(1) (which also spontaneously breaks parity) leaves an 
extra singlet per family, which decouples, showing the cancellation for the Standard 
Model. 


(2) Another way is to start with SO(10), which is anomaly free for any represen- 
tation of fermions: 


tt Gags {Giea, Gert) = 0 


simply because there is no combination of Kronecker 6’s with the appropriate sym- 
metry (and similarly for SO(N), except for N=2 or 6, where such a term can be 
produced with the € tensor). Breaking to the Standard Model again drops just 
a singlet (as does breaking to SU(5), showing its anomaly cancellation; breaking 
to SU(4)@SU(2)@SU(2) drops nothing, again showing its cancellation). In general, 
proving anomaly cancellation requires (a) using such arguments about real represen- 
tations, or (b) the absence of anomalies for certain groups (namely, only SU(N) for 


N>2, or U(1), can have anomalies), or (c) explicitly calculating the relevant traces. 


A. 79 —+ 24 


When an anomalous axial symmetry appears only as a global symmetry classi- 


cally, unitarity is preserved, since no gauge field couples to that current. This can be 
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a useful way to explain approximate global symmetries. The fact that the anomaly 
is always a total derivative (because of the € tensor and the Bianchi identity for F’) 
means that the global symmetry is not broken perturbatively. (However, when the 
external vectors are nonabelian, there can be contributions from field configurations 
like instantons: See subsection HIC6.) In subsection IVA4, we saw that the neutral 
pion (7°), the lightest hadron, could be considered as the pseudogoldstone boson of 
an axial U(1) symmetry. We also want to consider the pion as a bound state of a 
quark and antiquark: If we knew the wave function, we could write the coupling, 
and calculate directly the decay of the neutral pion into two photons (7° — 27) via 
quark-antiquark annihilation, or at least find the leading low-quark-energy contribu- 
tion from the d-function part of the wave function (in the relative coordinates of the 
quark and antiquark), corresponding to the coupling to wy_,W. (An expansion of 
the wave function in derivatives of the 6 function would give coupling to currents 


containing derivatives.) 


Lacking such detailed information, the best we can do is extend the nonlinear a 
model approach, which is to look for the terms in the phenomenological Lagrangian 
(expressed in terms of composite meson fields, not fundamental quark fields) with 
fewest derivatives (i.e., those most important at low energy), applying the condition 
of (approximate) chiral symmetry. Specifically, the global axial symmetry 7’ = 7—20, 
where A’ = A for the photon field, along with the electromagnetic gauge invariance 
for A, under which the neutral pion field is invariant, would suggest couplings of pion 
to photon involving only Oz and F. 

However, the anomaly allows the existence of another term: Since by definition 
(from considering coupling to an unphysical axial gauge field) the anomaly is given 
from a local axial transformation, while the pion field transforms in a trivial way 


under this transformation, we can attribute the anomaly to the pion coupling as 


on = —20, r=- | 00-J, PenAr. oar) =0 


1 
_ 1 _ ab---cd 
=> t= [90-3 = | way dae nee 


Thus, in four dimensions we find the contribution 
In= [het PaP 


Using the abelian form of the Chern-Simons form (subsection IHIC6), we also can 


write this as 


In=- / 56,7) Boed 
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(In the nonabelian case, we can neglect the surface term only if the vacuum value of 
m has already been subtracted.) Adding this term to those found previously (the 7 
and A kinetic terms, as well as the quark terms that define the normalization of the 
nm field through its coupling), the decay rate for 7° — 2y can be calculated (including 
the 2 relevant flavors of quarks, and 3 colors, using the values of their electromagnetic 


charges), and is found to agree closely with the experimental value. 


Exercise VITIB4.1 
What is Ig in D=2? What is the interpretation of the pion field in terms of 
the fields of the Schwinger model (subsection VIIIA7)? 


The global anomaly in the nonperturbative case can be applied to the strong 
interactions (QCD), although not as straightforwardly: Considering the external vec- 
tors to be gluons (so there is an implicit trace above over the group indices), I gives 
a coupling of a neutral meson to a pseudoscalar glueball, as discussed in subsection 
IC4. If the vacuum gives a nontrivial value to tr(e%“F,,F.q) (as for instantons), this 


also leads to anomalous CP violation in the strong interactions. 


5. Vertex 


One-loop triangle graphs can’t be evaluated in terms of elementary functions. 
However, in QED the most important effects are at low energy. We therefore will 
evaluate the effective action in the quantum mechanical version of the JWKB ex- 
pansion, as an expansion in derivatives. The resulting approximation to the effective 
action thus will be local, but include terms of higher dimension than the classical 
action, whose coefficients are therefore finite and unrenormalized: By dimensional 
analysis, this means their coefficients will have powers of the inverse electron mass, 
which can be considered as the expansion parameter. (See also subsection VIIB8, 


where a scalar 1-loop vertex divergence was evaluated.) 


The propagator corrections have been found already in subsection VITIA1; now 


we calculate the vertex correction. The integral is 


Na= VRB + BS )v(K+b+ mw, D= gk’sl(k +p’)? +m] 5[(k +p)? +m’) 
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Without loss of generality, we can drop terms that vanish by the free fermion field 
equations; this corresponds to canceling them by fermion field redefinitions. We then 
evaluate the numerator by applying the identities 


p=p?=—-m’, g=p-p => (pt+p)=-4m’?-¢? 


pb = poy — of 
as well as the identities of subsection VIC4 for 7’...7, and the field equations p = ye} 
on the far right and p’ = 5 on the far left, to obtain 


N= (K+ BD VE+P) + 7+ BQk+ pt’) 
(K+ DVE+B) =F D+P E+ D+ P) — byd— Be — Bp — kod — Bk 
= [5k? +k: (ptp') —2m? — 3q?]y—(k+-p+p')\(E+ 225) +m? + B(pt+p!)— y+ Sek 
=> N= (5k?7— kk) +[k- @tpy— (ot pk + Sek] — (mi? + 5’) 


For the momentum integral we evaluate 
tke 


A3(x,m’, q”) = a = = fer ee 


f= Bye" + ix + $[(a1 + a2)(p +p’) + (a1 — a2)g] + ZA[(a1 + a2)?m? + ayA2q7] 


again on the fermion mass shell. We also have 


oo 1 oo 1 l-ay 
fer=f[ ar f @as(i-Ya)= farm fdas fdas 
0 0 0 0 0 


using, e.g., the definitions 


/ dee 6() f(«) = 6(—a)6(b) f(0), / ” dx O(a — a)6(b — 2) f(x) = / dx f(a) 


As for the fermion propagator, we clearly separate UV and IR divergent integrals by 


the changes of variables 


love) 1 Qa 
a=a,+ a2, B=a,-a, => fer=f am f da} f dQ 
0 0 —a 


followed by 


love) 1 1 
ae, ee, ae fern f am f daa? s | d 3 
a? (0) 0 —1 


which modifies the integral to 


ioe) ik 1 
A3 = | dX xf da a1 sf dB e® 
0 0 = 
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B= t{ta°2? +iax-(p+p' + Bq) + Alm? + 4(1- B’)@]} 


We now expand to O(a”) and O(q’). The @ integral is then trivial (the integrand 
becomes quadratic in 3), the A integral gives the usual, and the a integral is similar 


to the case of the fermion propagator. The result is 


2 2 
oe q q Lge 


1 
+ i(a-(p+p'))? + jee + H(x-q)? + ys} 


This leads to the expression for the vertex correction 


Lae ae ee eee eee 
As3,qep © I'(1+ €)(5m*) 1S ee ee ee ~ 


m) err = 7 euy 
41(4_1%)\ pte 
4 Ane m/V/2 
where we have used 
¢=2 


in evaluating the contribution from the k? term. (Remember that all algebra from 
indices on the fields is done in 4 dimensions, while all algebra from indices on momenta 
is done in D dimensions. Since the two parts of the calculation are usually done 
separately, this should cause no confusion; however, the difference in evaluating 0% is 


the main thing to watch.) Using the on-shell identity 


Ate = {P+ P+ ld = —-@ +P) + lt 


we can rewrite this as (again keeping only O(q’)) 


1,2\-e i i 1 5 il 1 q 
A3,qep © I'(1+€)(5m*) — +53 +5] + 35,7 4 —s | 7 
IR 


2 
EIR EUV m? 


4 
Ale 

i 

2 

“~_—_—E—— 


The next step is to cancel the UV divergence by adding the counterterm for 


electron wave-function renormalization from subsection VIIIJA1: 


if 1 m? 1 g 1 
A3.geD,r = As,.gep + 0A3,QED © 3 a —In ee | a sae) 


ld. 7 
m/ V2 


Equivalently, we can take the gq = 0 piece of A3.gzp, and note that it combines with 


A». of subsection VIIIA1 to gauge-covariantize the term proportional to p > p+ A. 
(The unrenormalized effective action is thus automatically gauge invariant, as is the 


counterterm.) At this point we can see the anomalous magnetic moment: Combining 
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the tree and 1-loop result (including coupling), and writing as spinless + magnetic 


moment contributions, we have 


1 m? q ptp ld, 7] 
2 x <i 2/1 ( — _ Jy — 1) 4 4 ¢_2 1+e7)i 42+ 
y+e°As.gen, { +e E (= In m) +4] maT aia” ae er 


We can translate these 1-loop corrections into a contribution to the effective action 


as (with the usual —1 for the effective action) 


I, 3,9eD,9 = —Y(—p')As,QK0,-¥ (p) - A(q) 


We then note, again using the spinor (free) field equations to imply (p+ p’)-q = 0, 
to O(7’), 
459° qiaAy © a"(P + P')qaAy = 9°(p +p!) A 


The low-energy part of the renormalized effective action exhibiting up to order q?/m? 


corrections to the coupling is then, in gauge invariant form, 


2 
We 5 m € ate) 
luiser fe Yio A+ & amis oP, 


e? 1 m? Sak 
—_ E (=~ in) +] (0 Fa) bw 


Exercise VITIB5.1 
Perform the supergraph version of this calculation: a massless Abelian vector 


multiplet coupled to a massive chiral scalar multiplet. 


6. Nonrelativistic JWKB 


As for other processes, the application of quantum field theory to bound states has 
two steps: (1) Calculate the (gauge-invariant) effective action; (2) find solutions to 
the field equations following from the effective action (“on-shell” states). For bound 
states such solutions are nonperturbative; however, their determination is easier for 
nonrelativistic systems, since we can ignore production and annihilation of additional 
nonrelativistic (massive) particles in the second step because their effect already has 


been included as small corrections to the effective action. 


The Lamb shift is the (field theoretic) quantum contribution to the energy levels 
of the hydrogen atom, which is described accurately even at one loop. The relativistic 
solution is found by perturbing the relativistic effective action in derivatives about 


the nonrelativistic one, whose solutions are the usual exact ones of the nonrelativistic 
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Schrédinger equation. For atoms the electron speed p/m is of the order of a (= 27”), 


so the loop and derivative expansions are in the same small parameter. 


The effective action is more conveniently calculated with manifestly relativistic 
methods, since the internal (“virtual”) particles can be relativistic (especially those 
that contribute to the UV divergences). On the other hand, the solutions to the field 
equations are more conveniently calculated in a representation that takes better ad- 
vantage of the nonrelativistic expansion, since the external particles are nonrelativis- 
tic. Therefore, the second step begins by performing a field redefinition that converts 
the manifestly Lorentz invariant effective action to a form recognizable as nonrel- 
ativistic field theory with low-energy relativistic and loop corrections. (Originally 
the Lamb shift was calculated without this transformation. Higher-order calculations 
then required use of the relativistic Bethe-Salpeter equation, which made collection 
of terms of a given order more difficult.) In subsection HB5 we considered the gen- 
eralized Foldy-Wouthuysen transformation and its application to minimal coupling; 
we now apply it to the nonminimal coupling introduced by loop corrections. (In the 
literature this step has been performed on the Feynman diagrams themselves; how- 
ever, as usual we can save some effort by working directly with the effective action.) 
Here the nonminimal correction to the transformation is easy, since the nonminimal 


terms are already near the order to which we work. 


We first perform some dimensional analysis, using the fact that the leading be- 
havior is given by the usual nonrelativistic Schrodinger equation. Then the only 
parameters in units h = 1 (but there is no c in the nonrelativistic theory with just 
Coulomb interaction) are the mass m and speed e?, so (in the notation of subsection 
IIB5) 


mn’ ~ me?, n°? ~ me* 


(neglecting the rest mass contribution). It is then convenient to reorganize the ex- 
pansion in 1/m to relate to the expansion in e?: For example, we can identify the two 


by choice of units 


along with c = 1 (since we will include relativistic corrections). 


The relativistic form of the Schrédinger equation is obtained by multiplying 2° 
in front of the kinetic operator of the electron in a background electromagnetic field, 
as obtained from the effective action. Approximating the proton as infinitely massive 
(for which we can partially correct by using the reduced mass for the electron), we 
take the electric field as described by the usual static “scalar” potential, and drop the 


magnetic field along with the “vector” potential. 
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We therefore modify the expansion of subsection IIB5 by 
(1) reorganizing the 1/m expansion according to our dimensional analysis, 
(2) using only a static electric background, and 


(3) working directly in terms of y matrices: We can either plug in the Dirac case of 
the spin operators into the expressions of subsection IIB5, including the reality- 


restoring transformation of subsection IIB4, 
Gab = —tyleAl. Gola =5 = Lae 
or just expand the Dirac operator directly, 
29°(—f + Bh) = 0° — 2qy'n? + V2m7" 
(and similarly for the loop correction terms). 
From the reuslts of the previous subsection, we thus choose to order 1/m4 
Exg=V2y, Oo = 2y'p'y’, E, =mr° 
i [08 1 m* i [Oi 
O3 = —m*e* ni | aaa E, = me? E (= —In _ + i or 


(others vanishing), where we have included explicit m dependence so that the coeffi- 
cients €, and ©,, are of order m® according to our above dimensional analysis (so our 


expansion in m makes sense). Using 


tanh «x — za 


the relevant commutators from IIB5 are then, for the nonvanishing generators to this 
order 
mG = —3{[G, AE] + Lecoth(Le)O}E_1 


> Gy = = ZOE-1 
= — $([G1, €1] + 31G1, [Gi, Oo]])E-1 
— sO3E_1 


ee 
ol 


and for the transformed kinetic operator 


F' =E+tanh(sle)O 


> FL=&+4[G1,Op| 
Fs, = $1Gs, Oo] — H1G1, (G1, [G1, Ool]] 
Fy = Ea + 3[G1, Os] + 4[G4, Oo] 
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Remembering that €_; commutes with even and anticommutes with odd, we have 


identities like 
(Le.)"Oo = (—1)?™ V9 (Og)"*(E_1)”, (C4) =1 


Substituting for G into F’: 


Fi = E,+3(Op)E-1 
x = =10o, [Oo, E1]] ~~ £(Oo)*E_4 
= E+ ${Oo, O3}E_4 


The final result is, using 
(Oo)? = (p')?, me?(Op, E1] = —203E_1 


and setting €_; = —1 on the right for positive energy, 


1 m? a . —— 
Fi=me? 7 — —In— ) OF™ + ifS"{F™, p'} 
EIR pe? 
As expected from dimensional analysis, F; is the nonrelativistic result, F4 is the 
lowest-order relativistic correction, and Fj is the lowest-order part of the one-loop 


correction. Putting it all together, to this order we have 


Ppp (p')? — (p')? 1 g2( 1 m? ee 26 ca Oi 9 

td aga ne Teav) l= 3e ae ae OF +g ee} 
Exercise VIIIB6.1 

Find the additional terms in F’ to this order when the electromagnetic field 

is arbitrary (magnetic field, time derivatives of background), assuming the 


same dimensional analyis for the background. 


7. Lattice 


Integrals are defined as limits of sums. For some cases it can be convenient to 
define quantum theories on discrete spacetimes (“lattices”), perform all calculations 
there, and then take the limit of continuous spacetime. Two types of such lattices 
will be considered here: (1) Physical four-dimensional spacetime can be treated as a 


regular hypercubic lattice. Then the existence and uniqueness of a continuum limit 
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where Lorentz invariance is restored must be proven. (2) In first-quantization of par- 
ticles or strings, the worldline or worldsheet can be approximated as a random lattice 
(see subsection XIA7). Integration over the metric of the worldline or worldsheet is 
then replaced with summation over lattices with different geometries. The continuum 
limit is not required by physical criteria, but only for purposes of comparison to the 


theory as defined in the continuum. 
The use of a regular 4D lattice for quantizing QCD has three main advantages: 


(1) The lattice acts as a gauge invariant regulator for UV divergences (and, if the 


lattice is finite, also IR ones). 


(2) Gauge fixing is no longer necessary, since the path integral can be performed 


without it. 


(3) Nonperturbative calculations are possible, some analytically and some numeri- 


cally (if the lattice is small enough). 


Gauge fields are associated with translations through the covariant derivative. 
However, on a lattice, even a regular one, infinitesimal translations are no longer 
possible: For example, scalar fields are defined only at vertices of the lattice. We 


therefore consider covariantizing finite translations, as in subsections IITA5 and IIIC2, 


ek'Vm — P lew (-:f da! - A) CS wee 
a—k 


Without loss of generality, we can restrict ourselves to translations along links, from 
one vertex straight to an adjacent one (keeping all coordinates but one constant), 
and successive combinations of these. Then the gauge field is replaced with the group 
element U;,-, associated with each link, where k is now any of the 4 orthonormal 
basis vectors (in Euclidean space). The gauge transformation of this representation 
of the gauge field follows from either the path-ordered definition or the covariant- 


translation definition: 


eV! = gale ee) => Use» = 8(2)U e408 (e —*) 
Note that, while the gauge field is a group element associated with a link, the gauge 
transformation is a group element associated with a vertex. Furthermore, the field 
strength can be associated with the product of these group elements of the links 


bounding a “plaquet”: 


Ff dxz-A wipes PS x fic 
U0 Ue nek ewe he =P(e ia ) =e hVe PF VerVerv 


wy elhVIEV] 1 4 kk? Rk, 


556 VII. GAUGE LOOPS 


where we have used 


1 
B 5([B asi 
cB el = pBtC+aIB.Cl+ 


(In general, there is a geometric prescription associating a scalar with a point, a vector 


with a line, a second-rank antisymmetric tensor with a surface, etc.) 


We now define a gauge-invariant action by looking for an expression in terms of 
these group elements that approximates the usual Yang-Mills action to lowest order in 
the lattice spacing, while involving the least number of factors of the group elements. 


The result is: 


a —ttr So 5(ik*k Fy)? ~ Str) > F?(2) 
plaquets x 

(expanding the exponential as above to quadratic order, and noting that total commu- 
tators vanish when traced). Since our fields are now represented by group elements, 
we no longer need to fix the gauge to make the functional path integral well defined: 
In contrast to the continuum case, where integrating a gauge-invariant action over 
gauge transformations would produce an infinite factor, here such an integral at any 
one point is just an integral over the group space, which is finite (for compact groups, 
which have finite volume). The functional integration is now integration over U for 
each link, where the range of U is the group space (which is finite, since the group is 


compact). 


Matter can also be introduced: Scalars are naturally associated with vertices, 
just as vectors are with links, and second-rank antisymmetric tensors with plaquets. 
However, fermions do not have such a natural geometric interpretation. In particular, 
it has been proven (the “Nielsen-Ninomiya theorem”) that massless fermions can’t be 
defined in a useful way on the lattice without “fermion doubling”: There must be a 
multiple of 2? massless fermion fields for D lattice dimensions. This is closely related 
to the existence of axial anomalies: The absence of an anomaly is implied by the 
existence of a regularization that manifestly preserves a symmetry (in this case, chiral 
symmetry as a consequence of the existence of lattice-regularized fermions). However, 
massless fermions can be defined as limits of massive ones (so chiral invariance is not 
manifest). Alternatively, nonlocal spinor kinetic operators can be found that preserve 
masslessness and chirality without doubling. (The nonlocality can be controlled, but 


at the cost of a significantly more complicated action.) 
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Exercise XIC1.1 


For the lattice action for a spinor in D=1, use 
S= Fi) dant 


where w is a real one-component fermion. 
a Show this has the correct continuum limit. 
b Find the equations of motion. 


c Solve the equations of motion for both the lattice and continuum cases, and 


show the lattice has twice as many solutions. 


d Repeat all the above for the single-component complex (Dirac) fermion, 
S= Ce _~ Vn)Vn 


e Make the same analysis for the D=1 scalar, and show it has no such problems. 


Exercise XIC1.2 
In the book by Feynman and Hibbs, exercise 2-6 states rules for the path 
integral for a Dirac spinor in D=2. These rules are equivalent to the use of 
a lightlike lattice, where the lightcone coordinates are discretized. The rules 
are to consider all paths that are piecewise lightlike forward in time, with a 
factor of ime for each right-angle “kink” (where m is the mass and € the lattice 
spacing). Show these rules follow from the 2D action for a Dirac spinor (see 
subsection VIIB5, and include a mass term), using a term as in the previous 
exercise for the derivative term for each of the two component fields (each of 


which has a derivative in only one of the two lightlike coordinates). 


In general, fermions are more difficult to integrate over, particularly when using 
“numerical methods” (computers), since fermions are not numbers. In principle one 
can integrate out the fermions analytically to produce functional determinants in 
terms of bosonic fields, but nonlocality makes them hard to evaluate by iterative 
schemes. In practice fermion loops are usually ignored (“quenched approximation” ), 
which corresponds to leading order in an expansion in the inverse of the number 
of flavors, or the approximation of heavy quarks. The resulting accuracy of QCD 
calculations for low-energy parameters (masses of light hadrons, decay constants, etc.) 
is of the order of 5-10%. (Getting good numbers in nonperturbative calculations is 
significantly harder than in perturbative ones. The situation is expected to improve 
somewhat with the advent of faster computers.) Finding scattering amplitudes, or 


other properties that involve high-mass hadrons, is presently beyond the scope of 
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lattice methods. However, lattice QCD is one of the few methods so far to obtain 
numbers for comparison with experiment from nonperturbative calculations with the 
QCD action. (Other nonperturbative methods have also been restricted to low-mass 


hadrons, and basically study effects of chiral symmetry breaking, not confinement. ) 


The spacetime lattice allows a direct nonperturbative analysis of confinement. 
For example, consider the potential between a heavy quark-antiquark pair. The 
heaviness again allows us to ignore pair creation, and to treat the quarks as static. 
For simplicity, consider scalar quarks, as described by first-quantization. Since we 
approximate the quarks as static, the only relevant term in the quark mechanics action 
is the interaction term [dr @-A = [dx-A. Taking into account the nonabelian 
nature of the group, and ignoring the first-quantized path integration Dz (since z is 
assumed fixed), the factor e~* for the quark becomes just the path-ordered expression 
Pett 4A) ve have been considering, while for the antiquark we get the inverse 
expression. To get a gauge-invariant expression, we connect the paths at top and 
bottom, since the fields will be fixed at the boundaries at t = too. (Functional 
integration over any gauge-field link picks out just the singlet part of the integrand, 
since the integral is over the group, and nonsinglet representations can be rotated to 


minus themselves by an appropriate group element, canceling the contribution.) The 


ir P (cif 4) 


result is a “Wilson loop” 


The strong-coupling expansion is applied by expanding the functional integrand 
e~° in powers of $, which is an expansion in powers of 1/g?, and which is also an ex- 
pansion in the number of plaquets. Clearly the dominant term in this expansion is the 
one with the fewest factors of S. To be nonvanishing, each link variable must appear 
in a singlet combination: The function of that link, when expanded in irreducible 
group representations, must include a term that is proportional to the identity. For 
example, for any unitary group, this is true for the product U;/(U~1);!, where the two 
U’s are for the same link, and the indices are the group indices; this has the constant 
piece U;/(U~'),'. For the case of the Wilson loop, if we assume the simplest case 
where the path is a rectangle, then we need at least a factor of S for each plaquet 
enclosed by the loop, so there will a UU~! for each link on the boundary (one factor 
from the loop, one from S), as well as for each link enclosed by it (both factors from 


the contribution to S from either side). The result for the path integral is then 


A= (tr P a) evn ey 
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where V is the (potential) energy (S = [dt(V +7) in Euclidean space), t is the 
time separation between the top and bottom of the rectangle, and r is the spatial 


separation between the two sides. We thus have a linear quark-antiquark potential 
V(r) ~ (In g)r 


so the quark-antiquark pair is confined. 


Unfortunately, we can get a similar result from QED, by defining the U(1) group 
in terms of a phase factor (so effectively the range of group integration is 27, defining 
a “compact” group). The reason is that for this U(1) theory this strong coupling 
expansion is not accurate. The approximation is better for nonabelian theories, but 
the persistence of confinement has not been proven in the continuum limit (small 
coupling). In fact, while the transition to deconfinement in Abelian theories has been 
found at finite coupling, it has been proven that such a phenomenon can occur in 
the nonabelian theory only near zero coupling. However, the perturbative properties 
of the continuum theory show that this is exactly where one expects the appearance 
of ambiguities in the theory (known in lattice terminology as “nonuniversality”): As 
seen in subsection VIIC1, analytic continuation of the coupling near the positive real 
axis runs into trouble only near g=0. Although these problems might be resolved in 
finite theories, such theories require supersymmetry, which is difficult to treat on a 


lattice because of its problems with fermions, as discussed above. 
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The sign of the one-loop correction to the gauge coupling is opposite in QCD to 
that of QED: The photon coupling is weak at “low” energies (actually, any observable 
energy, since the coupling runs so slowly), while the gluon coupling is weak at high 
energies (with respect to the hadronic mass scale). Thus, typically perturbation in 
loops is used to study high-energy behavior of QCD, while the low-energy behavior 
awaits the discovery of a general nonperturbative approach. Although such an ap- 
proach is usually referred to as “perturbative QCD”, it is really a mixed approach, 
where amplitudes are generally factored into a high-energy piece, which is calculated 
with the usual Feynman diagrams, and a low-energy piece, which is found only from 
experiment. The “high” and “low” energy here refers to a parton that is liberated 
from a hadron, having low energy before and high energy after. In the processes that 
are best understood, this liberation is performed by an electroweak boson (photon, 
W, or Z), so one is actually calculating the electroweak interactions of a strongly 


interacting particle (quark), and its QCD corrections. 


1. Conformal anomaly 


Symmetries of the classical action that are violated at the quantum level are called 
“anomalous”. There are two major sources for such “anomalies” in renormalizable 
quantum field theory: (1) There are anomalies associated with the totally antisym- 
metric matrix €,..a,, called “axial” (see subsections VUIA7 and VIIIB2-4). When 
they occur, they are found in graphs with at least (D+2)/2 external lines. They are 
associated with graphs that have no divergences, yet require regularization. (2) The 
existence of divergences requires the introduction of a mass scale even in theories 
that are classically conformal. If anywhere, these show up at least in the most diver- 
gent graphs, the propagator corrections. Normally, both kinds of anomalies will first 


appear at one loop. 


When anomalies are associated with global symmetries, they provide a natural 
way to explain approximate symmetries, in the sense of the perturbative approx- 
imation. However, when they occur in local symmetries, they destroy the gauge 
invariance needed to prove unitarity. The latter type of theory therefore must be 


avoided by applying the condition of anomaly cancellation in local symmetries. 


We have already seen the appearance of the conformal anomaly in our renormal- 


ization of divergent loop graphs: The introduction of a renormalization mass scale 
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breaks the scale invariance of a classically scale-invariant theory. The simplest ex- 
ample, and generally the most important, is the one-loop propagator correction. If 
we examine only high-energy behavior, then we can neglect masses from the classical 


action. 


Using dimensional regularization, the generic effect on the effective action of the 
complete one-loop propagator correction is to modify the kinetic term of an arbitrary 
massless theory to 

LOK (= +8; ng) ’ 
iE ie 
where @ is an arbitrary-spin field that we have normalized ¢ — @/g for some appro- 
priate coupling g (like the Yang-Mills coupling if @ is the Yang-Mills vector), A is the 
classical kinetic operator, ~ is the renormalization mass scale, and (3; is a constant 
determined by the one-loop calculation. As long as (3; is nonvanishing (i.e., the theory 
is not finite) we can rewrite this as 
16K, no 


where 

M2 — pee VAs" 
is a renormalization-independent mass scale: Any physical measurement will observe 
g and yw in only this combination. A choice of different renormalization mass scale 
is equivalent to a finite renormalization of g?, such that M is unchanged. In the 
case where g is dimensionless (the relevant one, since we are studying the conformal 
anomaly), the coupling constant has undergone dimensional transmutation, being 


replaced with a dimensionful constant. 


Exercise VITIC1.1 
Show this is the case for massless (scalar) ¢° theory in D=6 from the explicit 


one-loop correction. 


If there is more than one coupling constant, things are more complicated, but 
the same phenomenon occurs: One dimensionless coupling is replaced with a mass. 
A particularly interesting case is pure Yang-Mills theory: Then we can write an 
important contribution to the effective action as 


where F' is now the complete nonabelian field strength, and 0 is the square of the 
covariant derivative. Since this contribution by itself gives the complete 1-loop con- 
formal anomaly, the rest of the 1-loop effective action is conformally invariant. (All 
its M dependence cancels.) 
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Note that the (one-loop) anomaly itself is local: If we perform an infinitesimal 
conformal transformation on the one-loop part of the effective action, this variation 
gives a local quantity. This is clear from the way this anomaly arose in dimensional 
regularization: If there were no infinities, there would be no anomaly, since the naive 
conformal invariance of the classical theory would be preserved at each step. However, 
to regularize the divergence we needed to continue the theory to arbitrary dimensions, 
and the theory is not conformal away from 4 dimensions. The scale variation of a 
4D conformal action in 4-2e dimensions is proportional to € times that action, as 
follows from dimensional analysis; this scaling can be associated with the nonvanishing 
(engineering) dimension of the coupling away from D=4. (Usually, we write the 
coupling as gu®, where g is dimensionless. The fields have engineering dimension 
independent of D, defined by the value in D=4: E.g., in V = 0+ A, A has the same 
dimension as 0.) However, the one-loop effective action is coupling independent; 
thus, when dimensionally regularized but unrenormalized, it’s scale invariant. For 
example, in the propagator correction discussed above, we get a regularized term 
—16,¢K (50) ~‘¢, which is scale invariant but divergent. On the other hand, the 
counterterm added to make it finite is from the 4D conformal action, and thus is not 
scale invariant in D44; so the breaking of scale invariance can be associated entirely 
with the counterterm. (Le., the anomaly coming from the renormalized, nonlocal 
effective action is equal to that coming from the infinite, local counterterm.) Since 
the counterterm is local, the anomaly is local. It’s also finite, since it’s proportional 
to € (from the variation) times 1/e (from the divergent coefficient of the counterterm). 


In our propagator example, we have 
=¢ _ O 
—26,¢K (30) e+ 4(Su") oko & bok ina? 


A similar situation occurs for the axial anomaly with Pauli-Villars regularization: 
After regularization, the anomaly comes entirely from the regulator graph, which is 
not only finite by power counting, but local in the infinite-mass limit because that is 


the zero-momentum (effective potential or JWKB) limit. 


There is a physical significance to the sign of the constant 3,. (We saw some 
evidence of this already in our analysis of renormalons in section VIIC.) Instead of 
thinking in terms of the renormalization-independent mass scale M, we can treat g 
as an effective energy-dependent (“running”) coupling, 


1 1 py 
son = th In-5 
Gp?) ge LP 
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In the case 3, > 0 the coupling gets weaker at high energy (“asymptotic freedom” ), 
while for @; < 0 the coupling gets stronger at high energy (until it reaches the Lan- 
dau ghost). (For low energy the situation is generally more subtle, since we usually 
have complications from physical masses.) For QCD, this weakening of the cou- 
pling at high energy allows the separation of an amplitude into a nonperturbative 
low-energy piece (describing the observed particles, the bound-state hadrons), which 
is determined experimentally, and a perturbative high-energy piece (describing the 
non-asymptotic, fundamental “partons”, gluons and quarks), which can be calculated. 
(This sometimes goes under the somewhat misleading name of “perturbative QCD”.) 
This strongly contrasts with QED, where the weakening of the coupling at low en- 
ergy means both fundamental particles (photons, electrons, etc.) and bound states 
(positronium, atoms, etc.) can be treated perturbatively, and the only experimentally 
determined quantities are the values of masses and the electron charge (coupling at 
low-energy). Thus, in QED one in principle can calculate anything, while in QCD 
one is restricted to parts of certain amplitudes. (Various nonperturbative methods 
also have been developed for QCD, but so far they have successfully calculated only a 
few low-energy constants, as used in 0 models, i.e., masses and low-energy couplings. ) 
Although experimental verification of these results is sufficient to confirm the QCD 
description of hadrons, a practical description of hadronic cross sections at all ener- 
gies would seem to require a string model that can incorporate behavior attributed 


to both strings and partons. 


2. ete — hadrons 


If quarks and gluons are confined, how can QCD be useful? QED is useful be- 
cause the coupling is small: e? & 1/861 is the perturbation parameter in relativistic 
(quantum field theory, or 4D) calculations, a = 27e? = 1/137 in nonrelativistic 
(quantum mechanics, or 3D). Energy levels of the hydrogen atom can be calculated 
quite accurately, without the question of freely existing electrons and protons coming 
up. The speed of the bound electron is also a, another way to understand why pair 
creation/annihilation and other relativistic or multiparticle effects are small, and can 


be treated perturbatively. 


Therefore, the real usefulness of a field theory depends not on how “physical” the 
choice of fields is, but how accurate the perturbation expansion is. “Nonperturba- 
tive” results may give some nice qualitative features, but they are ultimately useless 
unless they can be used as the basis of a new perturbation expansion. (Attempts 


at nonperturbative approaches to 4D quantum field theory continue, but so far the 
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results are meager compared to perturbative results, or to nonperturbative results in 


quantum mechanics or 2D quantum field theory.) 


The simplest application of QCD is to the production of hadrons by a photon 
created by the annihilation of an electron and a positron. The total cross section for 
such an event is given (according to the optical theorem) by the imaginary part of 
quark contributions to the photon propagator: Since hadrons are made up of partons 
(quarks and gluons), we assume a sum over hadrons can be written as a sum over 
partons. This assumption, that hadrons can be described by a resummation of the 
perturbation expansion, should be good at least at high energies, where the partons’ 
asymptotic freedom takes effect (and perhaps at lower energies by an appropriate 
extrapolation). To lowest order for the process under consideration this is a 1-loop 
graph, with a quark in the loop. If we compare this to the production of, e.g., muon- 
antimuon pairs (but not back to electron-positron pairs, because that includes the 
crossed diagram) by the same procedure, and we neglect masses (at high enough 
energies), then the only difference should be in the group theory: Hadron production 
should be greater by a factor of the number of colors times the sum over flavors of 
the square of the quark’s electric charge: 

_ Pete —h) 2 
Be Bete ey LM 


Experimentally this relation is confirmed for N, = 3, if the only flavors included in 


the sum are those with masses below the photon energy ((2m,)* < s). 


This result can be extended to the case where the momenta of hadrons are ob- 
served (not summed over): Although individual partons are not observed as asymp- 
totic states, the dominant contribution to the cross section at high energies is given 
by the conversion of the quarks into hadrons by the creation from the vacuum of 
parton pairs with energies, and angular deviation from the partons created by the 


photon, smaller than experimental accuracy. We treat all partons as approximately 
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massless, with respect to the energy scale of the photon. Thus, each parton created 
by the photon starts out initially as free, is then accompanied by parallel partons of 
small energy to form hadrons, and then these hadrons may further decay, but with a 
small angular spread with respect to the directions of each of the initial partons. Such 
collections of final-state hadrons are called “jets”. For high-energy electron-positron 
annihilation, the dominant hadronic decay mode of this off-shell photon is thus into 
two jets. This experimental result is further verification of QCD, and in particular a 
jet is the most direct observation of a parton. Of course, even for asymptotic states 
the directness of experimental observations varies widely: For example, compare a 
photon or electron to a neutrino. A closer analogy is unstable particles: For example, 
the neutron is observed as a constituent of the nucleus (as quarks are constituents of 


hadrons), but eventually decays outside (as quarks “decay” into jets of hadrons). 


A similar analysis can be applied to the creation of any electroweak boson by 


annihilation of a lepton with an antilepton. 


Exercise VITIC2.1 
Find the corresponding process (particles) for positron-neutrino annihilation. 
Find the expected numerical value of both this and the above R in the Stan- 


dard Model for energies well above the masses of all the fundamental particles. 


3. Parton model 


We have already seen that in quantum field theory coupling constants are usually 
energy-dependent. However, the dependence is only logarithmic, and thus can be 
treated as perturbative unless the relevant energy scale is within a few orders of 
magnitude of the mass scale that appears by dimensional transmutation. In QED, 
the value quoted for the electron charge is at the scale of the electron mass m. Using 
the result of subsection VUIA2 (or VIIIA3) for the 1-loop propagator correction, we 
find (neglecting higher-loop corrections) 

Mgep 


= e3/4 — 2.8380185(62) x 10° = Mogp = 1.4502244(32) x 10?”"GeV 


(where for fun we have included the 1-standard-deviation uncertainties for this 1-loop 
result as the figures in parentheses; the e in the exponent is the electron charge). 
Since the mass of the observable universe is of the order of 108° GeV, and the Planck 
mass (beyond which a particle will gravitationally collapse from its Compton radius 
falling within its Schwarzschild radius) is of the order of “only” 107° GeV, there is 
little worry of observing the QED Landau ghost, even if QED were correct to that 
scale. 
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On the other hand, the mass scale for QCD is (in the MS scheme) 
Macp = .217(24) GeV 


(This result depends on renormalization scheme, and is also an effective mass in 
the sense that the usual experimental energy scale is among the quark masses, so 
the high-energy approximation of the renormalization group is inaccurate, and the 
full propagator correction with quark mass dependence should be used. The above 
number is for above the bottom but below the top threshold.) This indicates that 
perturbative QCD is inadequate to describe properties for which the energy of the 
quarks is low, such as hadron masses (although nonrelativistic quark models have had 


partial successes). 


Exercise VITIC3.1 
Take masses into account in the simplest approximation: Treat particles as 
massless for energies above (twice) their mass, infinitely massive for ener- 
gies below. Approximate the masses of Higgs and superpartners of Standard 
Model particles as about the mass of the Z boson. Then graph the strong 
coupling 1/g? in the supersymmetric Standard Model (see subsection VIIIA4) 
as a function of the /n of the energy from the Grand Unification scale down 


to where it vanishes (g = 00), Mgcp. 


However, in certain processes a single “parton” (quark or gluon) in a hadron is 
given a high energy with respect to the other partons, usually a quark by electroweak 
interaction. In those cases, the “strong” (chromodynamic) interaction of that par- 
ton with the others in its original hadron is negligible: It has been liberated. The 
approach is then to factor the amplitude into a piece with the electroweak and high- 
energy (“hard”) chromodynamic interactions of this parton, which can be calculated 
perturbatively, and the low-energy (“soft”) chromodynamic part of the remaining 
partons, which is left as an unknown, to be experimentally determined. (Thus, the 
hard part is the easy part, while the soft part is the difficult part.) The predictive 
power is thus limited to the dependence of the amplitude on the energy of this parton, 


and on the particulars of the electroweak particles involved. 


Another possible complication would be the effect of exciting many partons within 
a hadron, indirectly through the first parton’s interactions with the rest: Then one 
would have several terms to sum in an amplitude, each with a different unknown 
soft factor, making the approach useless. Originally, it was thought that the high 
energy alone was enough to explain the parton acting as free once liberated from the 


hadron (based on “intuitive” arguments), but soon it was realized that this possibility 


568 VII. GAUGE LOOPS 


depended totally on the high-energy behavior of the theory: It requires the decrease of 
the coupling with increasing energy, asymptotic freedom (or superrenormalizability, 
or finiteness with effective asymptotic freedom through the Higgs effect). Based on 
this property, one can show from the usual perturbation expansion that one soft factor 
(per each hadron with an excited parton) is sufficient as a leading approximation, a 
property known as “factorization”. This feature is a consequence of the fact that the 
dominant contributions to Feynman graphs in this high-energy limit are those where 
the values of the momenta of some of the partons are those corresponding to their 


classical mechanics, as described in subsection VC8 and VIIA6. 


This new approximation scheme is effectively a perturbation expansion in the 
inverse of the energy being channeled into this parton. One neglects terms that are 
smaller by such powers (including those from masses and renormalons), but incorpo- 
rates logarithms through the renormalization group and other loop corrections to the 
hard factor. Since available energy scales are much nearer to Mgcp than to Mgzp in 
QED, such an approximation scheme tends to break down around two loops, where 
the corrections compete with the neglected terms, ambiguities in renormalization 
schemes, and the relative size (convergence) of successive terms in the expansion. Al- 
though the accuracy of the predictions of this approach cannot compare numerically 
with those of QED, it is the only method to describe such processes that can lay 
claim to being a theory, and provides direct experimental evidence of the validity of 
QCD, both as a qualitative description of nature and as a valid perturbation scheme. 
(As in the previous subsection, we also have processes where all the partons appear 
only in intermediate states, or effectively so for final states in total cross sections via 


the optical theorem, so factorization is unnecessary.) 


The most effective application of factorization is to “Deep(ly) Inelastic Scattering 


(DIS)”. (An equivalent method for this process is the “operator product expansion” , 
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but unlike factorization there is no useful generalization of it to general processes. ) 
In this process a high-energy photon (or intermediate vector boson) is exchanged 
between a lepton (usually an electron) and a quark. (This is the leading-electroweak- 
order interaction of a lepton with a hadron.) The quark and rest of the hadron 
do not interact again: Color singlets are obtained by the creation of soft partons 
from the vacuum, which split from their own singlets and eventually combine with 
the separated quark and hadron. For this process one calculates only the total cross 
section, at least as far as all the strongly interacting particles are concerned ( “inclusive 
scattering”) but again this can be generalized to the observation of jets (“exclusive 
scattering”). Applying the optical theorem, and ignoring the leptons, the leading 
contribution to this process is given by the tree graph for scattering of a vector 
boson off a quark, where the intermediate quark has a cut propagator. This is the 
perturbatively calculated hard part, which is later attached to the soft factor. Thus 
the hard part is the lepton-parton cross section, while the soft part is the “parton 
distribution” , giving the probability of finding a parton in the hadron with a particular 
fraction € (> 0, < 1) of its momentum p. To leading order this fraction is determined 
by kinematics: Since the hadron and scattered parton are treated as on-shell and 


massless, 


2 2 
r= Co =) |S _=2S =, — 
( ) an 


so the “(Bjorken) scaling variable” x is a useful dimensionless parameter even when 
(at higher orders) € # x. The energy scale is set by the square of the momentum q 


of the vector boson. 


There are several approximations used in this analysis, all of which can be treated 


as the beginnings of distinct perturbation expansions: 


(1) The hard part is expanded in the usual (loop/coupling) perturbation expansion 
of field theory. The leading contribution is that of the naive (pre-QCD) parton 
model (“leading order”), where the quark that scatters off the photon is treated 
as free with respect to the strong interactions. One-loop corrections (“next-to- 
leading-order” ) introduce the running of the coupling associated with asymptotic 
freedom, which justifies the validity of the parton picture. This is usually the 
only perturbation expansion considered, because such corrections are logarith- 
mic in the energy of the exchanged parton (rather than powers), and thus more 
important and easier to isolate from the data. Furthermore, by the usual renor- 
malization group methods such logarithmic corrections can be reduced by careful 
choice of renormalization scale (i? close to q? in In(q?/?)). Two-loop corrections 


lead to various ambiguities, and have not proven as useful yet. In particular, the 
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G function is scheme-dependent past two loops, making the dependence on the 


separation between hard and soft harder to fix. 


In calculating the hard part “light” quarks are approximated as massless. One 
can rectify this by also perturbing in the masses, as a Taylor expansion in the 


square of each mass divided by the square of the vector boson’s energy (m?/q?). 


In the explicit calculation the momentum €p of the excited parton is assumed to 
be proportional to the momentum p of the initial hadron. In the rest frame of the 
initial hadron (which is massive in real life), this corresponds to the nonrelativistic 
approximation of motionless quarks; one quark is then set into relativistic motion 
by the photon, liberating it from the hadron. Thus the parton model simulta- 
neously uses a nonrelativistic approximation for a parton before it’s scattered, 
and an ultrarelativistic (near-speed-of-light) approximation after it’s scattered. 
This nonrelativistic approximation can be corrected by a JWKB expansion (ex- 
pressed in operator language, the operator product expansion), also known as 
an expansion in “twist” (effectively, the power of momentum transverse to p). 
However, this means a separate soft part for each term in the expansion: Since 
these are determined experimentally, such an expansion would lead to a loss of 
predictability. Thus generally (with few exceptions), parton model predictions 
are restricted to high enough energies (q*) that such corrections can be neglected. 
In this sense, this approach is very similar to low-energy approaches to hadronic 
physics, e.g., nonlinear o models: Useful results are obtained at lowest order for 
describing physics in a certain energy range, but outside that range the increas- 
ing loss of predictability, e.g., nonrenormalizability, makes the approach less and 
less applicable. (Another, related, similarity between this approach and nonlinear 
a models is that both were originally described in the language of the operator 
product expansion, as applied to currents. However, this language was later re- 
placed in both cases because of the difficulty of evaluating operator products of 


more than two currents. ) 


Expansions in renormalons (see subsections VIIC2-3) introduce new coupling con- 
stants, effectively nonperturbative corrections to the otherwise perturbative hard 
part. Like all but the first of these expansions, this leads to correction terms that 
are down by powers of 1/q?. This type of correction could be absorbed into the 
previous one, since in principle the soft parts should contain all nonperturbative 
corrections by definition. However, this would be begging the question, since it 


would mean more parameters to be determined by experiment. 
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The other common application of the parton model is to “Drell-Yan scattering” : 
In this case two hadrons scatter producing, in addition to hadrons, a photon (or other 
electroweak boson) that decays into a lepton-antilepton pair. To lowest order, the 
relevant diagram is the same as for DIS (crossing some of the lines). Because both of 
the initial particles are hadrons, 2 soft parts are required; however, each of these is the 
same as that used in DIS (“universality”) so they do not need to be redetermined. In 
fact, there is a direct progression from ete~ to DIS to Drell-Yan: The above diagrams 
are similar except for the number (0 — 1 — 2) of soft parts (corresponding to the 
number of initial hadrons); the leading contribution comes from the same diagram, 


rotated to various positions (crossing). 


More generally, we can consider not only hard parts involving identified quarks 
in the initial state of the hard part, but also in the final state, by examining jets. 
Thus, for soft parts we have not only the “parton distribution functions”, which are 
energy-dependent probabilities to find specific partons in specific hadrons, found from 
amplitudes for an initial hadron — parton + anything (summing over anything), we 
have “fragmentation functions”, which are probabilities from amplitudes for parton 
— final hadron + anything. In principle these are related by crossing symmetry: 
The diagrams are similar to the previous, with the partons connecting to the hard 
part, but the external hadron lines may be either initial or final (and the opposite 
for the corresponding parton with respect to the hard subgraph). As for the parton 
distributions, the fragmentation function for any particular parton and hadron is 
measured in one particular experiment, then used universally. (The simplest is deep 


inelastic scattering for the parton distribution, and ete~ annihilation with one of the 
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two jets — hadron + anything for fragmentation.) Then the cross section is generally 


of the form 
dora..n => f dkn-A&y Faalba) “Fol s)do, Gs) 
a...b 


where do 4...p is the observed (differential) cross section, a...b (not to be confused with 
vector indices) label the different partons and A...B their hadrons (we leave off the 
labels for non-strongly interacting particles), the sum is over different kinds (and fla- 
vors) of partons (and perhaps over different hard parts, if corrections down by powers 
are desired), €, is the momentum fraction for parton a of hadron A’s momentum, fa 
is either the parton distribution function for A - a+ X (X = “anything”) or the 
fragmentation function for a > A+X, and dog...» is the hard cross section (calculated 
perturbatively), which is just the original with all the hadrons replaced by partons. 
For the parton distributions we integrate fo d€, while for fragmentation we integrate 
f = dé, or change variables to the hadron’s fraction of the parton’s momentum ¢ = 1/& 
and integrate te dé. 


Note that, while physical cross sections are independent of the renormalization 
mass scale pu, the same is not true of the hard cross sections calculated perturbatively 
in the above factorized expressions, since they are expressed in terms of unphysical 
quark “states”. However, these hard parts satisfy renormalization group equations, as 
calculated in the usual perturbative way. (Of course, nontrivial contributions require 
calculating beyond leading order.) This implies corresponding renormalization group 
equations (see subsection VIIC1), the “evolution” or “Griboyv-Lipatov-Dokshitzer- 
Altarelli-Parisi (GLDAP) equations”, to be satisfied by the parton distributions, so 
that js dependence cancels in the complete cross sections. This determines the energy 


dependence of the parton distributions. The equations take the form 


1 
we faolG 02) = ey = fav(G, H”) Pha (5.72) 

where f4, describes A > a+X, f4y describes A — b+X’, the “splitting functions” Py, 

describe b — a+X” (X = X'+X"), and the sum is over the intermediate parton b. For 

hadron A with momentum p, the intermediate parton 6b has momentum ¢p, and parton 

a has momentum €p, so €/¢ is a’s fraction of b’s momentum. The kinematics are such 

that 0 <4 <€<¢ < 1 (momentum is lost to X’s as A > b > a). The splitting 


functions can be calculated perturbatively from the corresponding renormalization 


group equation for the hard part, since the combined jz dependence must cancel 


in the physical cross section. Specifically, one considers the same equation with A 
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replaced by another parton: Since P,, is independent of A, it can then be found from 


a completely perturbative equation. 


For similar reasons, the hard cross sections are infrared divergent; the soft parts 
of the complete cross sections deal with low energies. This leads to complications 
beyond next-to-leading order, due to the fact that the renormalization group scale p, 
which relates to ultraviolet divergences (high-energy behavior), and the “factorization 
scale”, which relates to infrared divergences (it determines the division between hard 
and soft energies), are in principle independent scales. This allows an ambiguity in 
factorization prescriptions, in addition to the usual ambiguity in UV renormalization 
prescriptions. (In more general processes there can be other energy scales than just 


q’, each with its own factorization scale, further complicating matters.) 


4. Maximal supersymmetry 


The results of loop calculations simplify when the amount of supersymmetry is 
increased; in particular, more things vanish. We have already seen this with respect 
to divergences in subsections VIIIA5-6. Furthermore, in the massless case, vanishing 
of propagator divergences implies vanishing also of the finite parts of propagator 
corrections, since the unrenormalized corrections are always proportional to LI ‘/e 
(except in D=2 from anomalies: see subsection VHIA7). These simplifications make 
supersymmetric theories useful models; if supersymmetry is used to eliminate the 
renormalon problem, these results are also physically relevant. (We have already seen 
that supersymmetric methods are useful to derive nonsupersymmetric results for tree 
graphs, where unwanted particles can decouple. Similar results can hold in 1-loop 
graphs, where supersymmetric results can be used to trade particles with spin in the 
loop for scalars, which are easier to calculate.) In this subsection we will examine this 
behavior in 3- and 4-point functions. We will find cancellations from just algebra, 
without momentum integration. In general our analysis will apply to any D < 10 


(since super Yang-Mills doesn’t exist in D > 10). 


Specifically, we will calculate amplitudes for external gauge bosons, using the same 
methods as for propagator corrections in subsection VIITA3. There we found a unified 
kinetic operator in background Yang-Mills, allowing us to separate the coupling into 
spinless (covariant 1) and spin (F'°S,,) pieces. In general the contributions will differ 
in form depending on the number of S vertices that contribute, so we will require 
separate cancellation for each case. We now consider these contributions order-by- 
order in S, but with an arbitrary number of non-S' vertices (i.e., whatever number 


is needed to give an n-pt. graph for whatever n we are considering). Since the no-S 
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terms are by definition spin-independent, they cancel for supersymmetric theories. 
(The ghosts, together with the —1/2 factor for spinors, guarantee that the trace for 
spin gives the supertrace for physical degrees of freedom.) Cancellation of S' terms is 
trivial, since tr S' = 0 (trace in states, not indices). As we saw in subsection VIIA3, 


tracing the S* terms gives the usual Casimir of SO(D): 


0 for spin 0 

tr(SapSea) = NofeNdja X ¢ ztr(Z) for spin § 

Z for spin 1 
Its cancellation fixes the number of spinors to be that of maximally supersymmet- 
ric Yang-Mills. The $° terms (in 3-point and higher graphs) can be separated into 
tr(S[S,.$]), which reduces to $? (already canceled), and tr(${5,S}), as in 4D anoma- 
lies for internal group SO(D). The latter could give « terms, but only in D = 2 or 
6. We would miss them because of our (parity) doubling, but such parity invariance 
occurs anyway in maximally supersymmetric Yang-Mills in D < 10. The net result 
is that the 1-loop graphs completely cancel at less than 4-point, and the 4-point con- 
tribution comes completely from $* terms, but only for maximally supersymmetric 
Yang-Mills. (For these calculations we needed only the same field content, but su- 
persymmetry will then kill these lower-point graphs with fields other than Yang-Mills 
externally.) The amplitude is then proportional to F“4 times a scalar box graph. This 
F* factor turns out to be the same one (including Lorentz index structure) that ap- 
pears in the tree amplitude (although the tree factor requires much more work to 
derive, except when one uses methods specific to D=4, even though it is the same as 


for pure Yang-Mills). 


For example, representing group theory by the ’t Hooft double-line notation, let’s 
look specifically at the graphs with 2 external fields on one line and 2 on the other. 
Then we get tr(F'F’)tr(F'F) for the internal symmetry traces. By Bose symmetry, the 
4-Lorentz-index color singlet tr(F'F’) can consist only of the symmetric part of the 
direct product of 2 2-forms, i.e., a tensor with the symmetry of the Riemann tensor 
(Young tableau a 2x2 box, including traces) and a 4-form (single column of 4). First 
consider the case where the 2 F’s in a trace are adjacent on the loop. Then the form 
of S,» for spin 1 requires its spin trace always to give traces for those 2 F’’s, and thus 
contributes only to “graviton” (traceless symmetric tensor) and “dilaton” (scalar) 
type couplings to these singlets. On the other hand, the form of S for spin 1/2 always 
gives forms (times traces) for these 2 F’s, and thus contributes only to dilaton and 
“axion” (4-form) type couplings. (In D=4 this axion is the usual pseudoscalar; for 
the relation to the usual string 2-form by duality see subsection XIC6.) For the case 


where the 2 F’s are not adjacent, we commute the corresponding S’s so that they are: 
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The commutator terms give traces of fewer S’s, which cancel from our conditions, so 
the result is the same. (In the string case the momentum integral is different, so these 
graviton, dilaton, and axion couplings actually couple to those states, appearing as 


poles: see subsection XIC6.) 


Exercise VITIC4.1 
Work out all the explicit Lorentz and group theory traces for all graphs. 


Exercise VITIC4.2 
For D=4, reproduce these results using the methods of subsection VITIA6, 


thus automatically including external fermions. 


Similar methods can be applied when coupling gravity externally, essentially by 
replacing the Yang-Mills generators by a second set of spin operators, and the field 
strength with the Weyl tensor W. (Without loss of generality, we can drop the Ricci 
tensor, since it vanishes by the free field equations, allowing us to drop other terms, 
not of this form, that might appear. These tensors are discussed in subsection IX A4.) 
This is not surprising if we know string theory, where gravity vertex operators are 


obtained as the product of left and right-handed Yang-Mills vertex operators. 


Explicilty, if we perform the same procedure for external gravity as for external 
Yang-Mills (background gauge fixing, squaring fermion kinetic operators), we find a 
universal kinetic operator 


Oo a Wee Sap ce 


when coupling to spins 0, 1/2, 1 (or arbitrary forms), 3/2, and 2. (For spins 3/2 and 
2, we need to introduce “compensators” for local S-supersymmetry and Wey] scale 
symmetry: see subsections IXA7 and XA3.) In particular, the WSS term vanishes 
for spins 0, 1/2, and 1 (and forms) by explicit evaluation. (W is traceless and has no 
4-form piece.) 

We can then write all these spins as linear combinations of direct products of just 
spins 0, 1/2, and 1: 

Sa =s2 5 


But when we plug this into the W S'S term above, only the cross terms will contribute. 


Thus the kinetic operator becomes 
abcd OL GR 
O4+ swe gs Se 


Consequently, all vertex operators for all interesting spins (as in supergravity) can be 


expressed as direct products of those for spins 0, 1/2, and 1 (as for super Yang-Mills). 
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(Note that p’ = p®, and the connection term in the covariant derivative becomes 
wa? (She — Si)-) 

For example, taking the direct product of left and right maximally super Yang- 
Mills of the same chirality, we find the simple result W*, with no lower-point diagrams, 


for maximal supergravity (twice the number of supersymmetries of maximally super 
Yang-Mills). 


5. First quantization 


In subsection VIIIB1 we found some simple low-energy results for gauge loops 
applying JWKB methods. The approach was essentially quantum mechanical, us- 
ing the Hamiltonian formalism. Here we use the quantization procedure in a more 
explicit way: We work now in the 1D Lagrangian formalism, using 1D propagators 
and vertices, for calculating complete loops. This method will be the most useful one 


when applied in chapter XI to 2D Lagrangians for strings. 
Vv Vv Vv 
i i i 
I I I 
! ! i 
i i i 


1/Ho 1/Ho 


The propagator 


1 i 1 L 1 


ie. tee a 
Be eee Cea 7 eee, | Mee 
H H,=V ie” as 7 


—V — 
Ay Hy Ho Ho Ho 
gives the N-point graph 

1 1 


1 
Vino. Vopr Valha) 


ee eae 
n = (—kw|Vn “TT, E, 


(We can see see this from the usual Feynman diagrams, or by the relativistic gener- 
alization, along the lines of section IIIB, of the nonrelativistic quantum mechanics of 


subsection VA4, especially exercise VA4.1.) Restricting all states to scalars, 
Ho = 3(p? +m’), VYi=ge* 


The initial and final states can also be defined by the same vertex operators that 


created the external states: 


pik) = klk) => |k) = e***|0) 
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(We have used —ky so all states propagate inward: 5>k = 0.) 


We now translate this to the Heisenberg picture by absorbing the free propagators 


1/Hp into r dependence for the V’s: First we introduce Schwinger parameters, 


— = are 
aly 0 


=> Ay = [ dN 37(—ky|Vy_1e 8-2 2°...e- Val kx) 
0 


Then we change variables from the “relative” 7’s 7; to the “absolute” 7’s 7;, 
-2 
= Te = is Tn-1 = 0 S %#=-So5 

j=i 

and use the T-dependent V’s 

VGi=e™V ie, V(0)=V 
to write 
An = / dN-37 (—ky|Vn—1(0)Vn_2(Tn—2).--Vo(72) |x) 
—00 SF <Fi41<0 
when the initial and final states are on shell, 
Ho|ky=0 & k*+m?=0 


This is the same form that appears in nonrelativistic quantum mechanics (in the 
Heisenberg picture), simply evaluating operators for the potential at arbitrary times, 
where one time is not integrated over because of time translation invariance. (Its 


integral gives the usual 6 function for energy conservation. See subsection VA4.) 


We can also write the initial and final states in terms of the same vertices, for 


arbitrary 7’s: 
V(F)|0) = e7 #°Ve*0(0) = e7 FOV |O)e 7? = ge*Ho|p\e-7/? — |k) ge 7? 
Then the amplitude is 


Ay=g? lim ea (0| Viv (717) Vina (Fv _1)..-V2(72)Vi (7) |0) 


This is the form of the amplitude we might have expected from a first-quantized path 
integral in terms of X(7), with an interaction term — { dr V(r), except that 3 of the 


V’s are not integrated. 
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Now we can easily evaluate the S-matrix element by path integration, since X 
appears everywhere in exponentials. We saw in subsection VIIB5 that multiplication 
of 2 exponentials inside a path integral for a free theory yields their “normal-ordered” 
product times the exponential of a Green function. Since all the vertex operators 
(including those for initial and final states) are exponentials, we can also see this 


result from completing the square in the functional integral. Either way, the result is 
TI : explik, - X (7;)] ; = exp - So ki ky (X (%)X(%)) 
i i<j 
(The normalization is clear from Taylor expansion.) As a consequence of normal 


ordering each vertex, we drop any terms coming from connecting a vertex to itself 


with a Green function. 


We normalize the Green function as 
(X X)=5G 


Then the amplitude is simply 


T1—— 090 


i gh? ee ene pees exp - S- kj 60.2) 


7N—1=0 <j 


Pemnaes Ti STi 41 


We can use the propagator for X(T) 
—1G =§ => Gi7,7r)=-|r-7' 
where we have applied the boundary conditions 
GT )=GClr', 7); Gar) =0 


to avoid “renormalization” for the 2 = 7 terms. We then get 


Ay = 9%? lim ae kG exp E WE, <The | 


T{1—7—- 00 
FN _—1=0 i<j 
TN +00 


TiSTi41 
At this point we could convert back to the original r’s to make the integrals easier 


and arrive essentially at the starting point. 


Now that we understand the approach for trees, we can analyze loops. 1-loop 1PI 
graphs are easy to relate to the tree graphs we have considered: Starting with a tree 


graph with 1 long line out of which branch external lines (but no trees), we connect 
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the 2 ends of the long line with another propagator. (There are fancier arguments, 
but they are less convenient: Unitarity gives only the imaginary part of the loop, and 
requires then a dispersion relation. Feynman’s tree theorem gives a cut propagator, 
plus additional multi-cut graphs that must be argued away.) This is obvious from the 
diagrammatic point of view; the reason we start from a tree is that normally quantum 
mechanics is just matrix mechanics, and is geared toward sandwiching a product of 
matrices (operators) between 2 vectors (states). A loop is then a trace of a product 
of matrices, where the initial and final states have been replaced by a sum over all 
states (trace), and the initial and final “times” have been identified, making “time” 


periodic. 


We thus start with an amplitude of the form 


i. A4 . 
a a a eee V; = elke 
Be EN a) ool rm); 


This is the same expression we used earlier for trees, except for the extra propagator 


1/Hp and the trace, explained above. Again as for trees, we introduce Schwinger 


i] [ ants 
—= gre 7 
Ho 0 


and change variables from the N “relative” r’s, 7;, to the N — 1 “absolute” 7’s, 7;, 


parameters 


and the “overall” 7, T: 


N N 
Tj =T;i — Ti-1, Tn =0 > T= - ) 7, T=-tH= ) Ty 
j>i 1 


and use the 7-dependent V’s 
VA=e*V(O)e*, V(O)=V 
to write 


Ay 


/ dT dN tr[Vx(0)Vn_1(n_1)...Vi (7 )e | 
-—o0<-T<ij<h4 


/ dT dN~'# S~(n, 0|V(0) V1 (Fv-1)...Val)|n, —T) 
-—o0<-T<ii<h4 


11<0 a 
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where we have written the trace as a sum over all states to show that the effect of 
the surviving propagator is to guarantee that there is a length of “time” 7’ between 


the initial and final times, which are sewn. 


The amplitude can now be evaluated by 1D Feynman graphs (by operator or path 


integral methods) as (compare the tree result above) 


AY = / dT dN'F V(T)exp —} So ki ; ac.) 
—o0<—-T<F<Fi41 <0 


t<j 


where the Green function G is the X-propagator for this worldline (see below). 
The “volume element” V(T’) (called the “partition function” in statistical mechanics, 
where T is the “inverse temperature”) comes from the “vacuum” amplitude found 
by evaluating the general amplitude for vanishing sources V = 1. (In the path in- 
tegral approach, it comes from the determinant of the Green function.) It depends 
only on the parameter T that defines the geometry (i.e., the range of “time” for G). 


Comparing our “result” to the original expression, we see 


dp 2 2 2 
_ THe) —T(p?+m?)/2 __ m—D/2,—-Tm?/2 
V(T) =tr (e =| gore ( )/ =T e 

A standard change of variables for loops is to factor out the scale T’ from the 
times: 

T; _ —Ta,; => 
AY = | dN—lq | dT TN VT) exp |-3 ok - kjG(a4, a, T) 
O<aj41<ai<l 0 


where a; are (some linear combination of) the Feynman parameters. 


At this point we need to note that the Green function can’t be defined in the 
strict sense, since we now have closed lines: In a closed space, the total “charge” 
must vanish, by comparing Gauss’ law inside and outside any closed “surface”. This 
is related to the existence of “zero-modes”, functions that are killed by the wave 
operator (d’Alembertian), on which this operator therefore has no inverse. In this 
case the zero-mode is a constant, since a constant is the only periodic function that 
is a homogeneous solution to the wave equation. This zero-mode corresponds to an 
invariance (translations), and must drop out anyway. It appears in general when 


solving the wave equation 
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where the OI kills any zero-modes mistreated by 1/D. More explicitly, this is satisfied 


for any space with coordinates 7 if 


—40G(r7,7') = 6(7 — 7’) + h(7) 


=> -40 f araee 7 alr) = 9(7) +h(r) faite) 


for some h, since the latter term vanishes by momentum conservation, the aforemen- 


tioned invariance: Since the source comes from vertex operators e*'* , 


j= > k6(r -— 7) = fi=De=0 


The Green function G(r, 7’) should be symmetric in 7 and 7’, since only the sym- 
metric part contributes to 7(1/O)7. For the loop the only choice for h is a constant, 
representing a constant “background charge” distribution in addition to the point 
charge represented by the 6 function in G’s wave equation. The value of the constant 


follows from integrating the Green function’s equation over the loop (just 7’): 


1 
length 


(or “length” — “volume” for a general space) so the total charge vanishes. The Green 
function itself is now determined up to a constant. 
We easily modify our earlier tree result for G(7, 7’) = G(r — 7’) to 
-}8 = (r)- 5 => G=-Hl+5 
We have written the result in a form valid for |r| < 7, which is sufficient in terms 
of 7 —7’ for 0 < 7,7’ < T. This function is a repeating parabola, rather than the 


“V”-shape of the tree case. T’ then scales out of G in a simple way: 

G(% — %) = —T lla; — a,| — (a4 — 0%4)"] 
Using the result for the volume element above, the T’ integration is then of the form 
[ dT TN-1-D/2.-TF (aim?) _ r(N _ a aed ama 


0 


for a function F’ found from the above prescription, where the I’ function shows 
the usual divergence structure for [ d?p p~?% (N propagators with no derivatives at 


vertices). All that remains are the usual, messy Feynman parameter integrations. 


Exercise VIIIC5.1 


Use this method to explicitly evaluate the propagator correction. 
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Finally we consider generalization from scalars to vectors. The basic idea is to 
make a “stringy” generalization of the exponent of e**: In terms of some o-dependent 


parameter P(c), 


On 


keeping only the first “excitation” (so we can describe massless vectors), where the 


[ EPO) Xl) + kX + e-OX 


mode expansion is implemented as a Taylor expansion of X about one of the bound- 
aries. The first term gives just the momentum dependence we have already used 
for the scalar vertex while the second term, if expanded in the exponential to lowest 
nontrivial order, gives the vector vertex. (Ghost dependence can be included by the 
same method.) In a more general approach, this corresponds to introducing arbitrary 
external states with a “source” P(o) by writing the wave functional W[X (c)] in terms 


of its functional Fourier transform W[P(c)]. 


Although this derivation was motivated by string theory, this result can be applied 
to particles. In fact, the usual contribution of an external vector to the particle action 


is g [ dr X - A(X), which upon Fourier expansion in X gives vertices 
Vimestor = Oh" X eikicX 


where & is the polarization vector. This is the same result obtained by expanding the 
exponential described above to first order in «, zeroth order reproducing the scalar 
vertex. (The o derivative gets replaced by a7 derivative for X satisfying the 2D wave 


equation: see chapter XI.) 


Evaluation of the amplitude is simpler if we keep the original exponential in both 


k and k, 
= gellkX+e-X) 


Then the only change in the amplitude is the replacement 
k; : kiGy _- k; : kiGa + Ky k,0Gi; + k; KO Gi; + Ky - Kj00 Gi; 


where O is the derivative with respect to 7; and 0" to 7;. Since G depends on them 
only through the difference 7; — 7;, we can write 0’ = —O. (For some applications it 


may be useful to integrate these derivatives by parts in the amplitude.) 


The last term gives a simple expression: 
00G = —0°G = 26(r) 


in terms of 7); = 7; — 7;. Unlike the other h-dependent terms, this piece gives direct 


contraction of vector indices on polarizations, instead of contraction of them with 


C. HIGH ENERGY 583 


momenta. The 6 term makes the 2 vertices corresponding to the 2 polarizations 
coincide: It correpsonds to the A? term in the coupling of an external electromagnetic 
field to this scalar. 


A simple example of this method is to keep the full exponential in «, but choose 
Ky = +k; 


in some units. This can be considered a more “stringy” model with higher-derivative 
couplings. The + will be interpreted as corresponding to the 2 boundaries of the 
string: We have written X — X + 0X as an infinitesimal expansion of X at either 
boundary about the center of an infinitesimal string. We can enforce this interpreta- 
tion with group theory by using the ’t Hooft double-line notation, so the “inside” and 
“outside” vertices couple to different lines, i.e., with different group theory factors. 


The above modification is now 
: 1 
Gig > Gig + (4i — 4j)Gig + (Hit;)2 [bn = 4 


where +; indicates which boundary the vertex is on. After again scaling 7; = —T'a;, 
the first term goes as T again, the second is T-independent, the last goes as 1/T. 
Ignoring the uninteresting 4-point contribution (which results in terms in the ampli- 
tude similar to those for fewer external lines), the most important new contribution 
for small T is the a;-independent 1/T term. It modifies the T integration for small 
T: Using 


> (Fabs hi hy = 3 > Hit shi - hy = 3 [= st 


i<j aj 


2 2 
=3(Sm- Der) =2(o4] = —2s 
Fy r I 
writing i = (J, I’) for the two “sides” of the particle, and choosing the external vector 


to be massless (k? = 0), we find 


i. dT TN-1-D/2¢-28/T = [ dT" T'D/2-N-1,-2T's = re _ Nistor 
0 0 


where 7” = 1/T. Thus, for the interesting case of the 4-point amplitude in 10 dimen- 
sions, we find a massless pole 1/s replacing the usual 1-loop divergence, which now 


appears only for the “planar” case where all vertices are on the same “side”. 


Exercise VITIC5.2 
To what explicit modification of the field action for a scalar does this stringy 


vertex correspond? 
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The above method gives coupling of external vectors to an internal scalar. Gen- 
eralizing the internal particle to a spinor or vector is not as simple classically, but we 
can make the generalization from what we know about such coupling from the field 


theory methods of subsection VIIIA3: Using the universal kinetic operator 
O+igF@ Sap 


with quantum spin operator 


0 for spin 0 
Sab = F700) for spin ; 
| fa) (o)| for spin 1 


we expand to linear order in the vector field: 
—}(O1 + igF Sy) © —40y + glA- (18) + (#8 A?) Sus 


where Lp is the free O1, and for convenience we have ignored ordering by using the 


(background) gauge 0: A = 0. Choosing the external field to have definite momentum, 
A,(z) = «,e"** 


In terms of the momentum p of the internal particle and k of the external field, we 
then have 
y= gxie®? (p, _ k? Sap) 


In Lagrangian language, we simply replace p > X: 


V =gnte**(X, — k’S.5) 


REFERENCES 


1 R.P. Feynman, Phys. Rev. Lett. 23 (1969) 1415; 

J.D. Bjorken and E.A. Paschos, Phys. Rev. 185 (1969) 1975: 
parton model. 

2 J.C. Collins, D.E. Soper, and G. Sterman, Factorization of hard processes in QCD, in 
Perturbative quantum chromodynamics, ed. A.H. Mueller (World-Scientific, 1989) p. 1; 
G. Sterman, Introduction to perturbative QCD, in Perspectives in the standard model, 
proc. TASI ’91, Boulder, Colorado, June 2-28, eds. R.K. Ellis, C.T. Hill, and J.D. 
Lykken (World-Scientific, 1992) p. 475; 

Sterman, loc. cit.; 

G. Sterman, hep-ph/9606312, Partons, factorization and resummation, in QCD & be- 
yond, proc. TASI ’95, Boulder, Colorado, June 4-30, ed. D.E. Soper (World-Scientific, 
1996) p. 327: 

reviews of factorization. 


3 


4 


C. HIGH ENERGY 


G. Sterman and S. Weinberg, Phys. Rev. Lett. 39 (1977) 1436: 
justification for jets from perturbative QCD. 

V.N. Gribov and L.N. Lipatov, Sov. J. Nucl. Phys. 15 (1972) 438, 675; 
Yu.L. Dokshitzer, Sov. Phys. JETP 46 (1977) 641; 

G. Altarelli and G. Parisi, Nucl. Phys. B126 (1977) 298. 


M.B. Green, J.H. Schwarz, and L. Brink, Nucl. Phys. B198 (1982) 474: 


1-loop S-matrices in superfield theory as limits of superstring loops. 
M.T. Grisaru and W. Siegel, Phys. Lett. 110B (1982) 49: 

direct 1-loop S-matrices from supergraphs. 

M.J. Strassler, hep-ph/9205205, Nucl. Phys. B385 (1992) 145: 
first-quantized gauge loops based on string methods. 

K. Lee and W. Siegel, hep-th/0303171, Nucl. Phys. B665 (2003)179: 
loops in stringy modification of field theories. 


585 


586 IX. GENERAL RELATIVITY 


PART THREE: HIGHER SPIN 


Higher-spin (unstable) particles have been observed experimentally. Whether 
they are considered elementary depends on how their theory is formulated. In partic- 
ular, a description of hadrons in terms of strings would have many advantages, such 
as unification of all hadrons, manifestation of duality symmetry, and calculability 
through an accurate perturbation scheme. 


Gravity and supergravity also include higher-spin particles. String theory might 
also yield some solutions to some of their problems, especially renormalizability and 
unification of all particles. Such gravitational strings would differ from hadronic 
strings in their mass scale and in the appearance of massless particles, including the 
graviton itself (in contrast to the massive “pomeron”, the analog of the graviton in 


hadronic strings). Gravitational strings might require supergravity. 


For these and other reasons supergravity and strings are two of the major areas of 
research in theoretical high energy physics today (although not the only ones). Most 
of the discussion of this part is introductory, and can be covered earlier, but it is not 
essential to the course; however, its inclusion in a field theory text is essential at least 


for reference. 


Gravity is uniquely defined as the force carried by a massless spin-2 particle: There 
are no such particles other than the graviton, and there is no massless spin-0 particle. 
Similarly, the photon is the only massless spin-1 particle. (Gluons do not appear 
outside of hadrons.) Thus, gravity and electromagnetism are the only long-range 
forces. But there are massive strongly interacting particles of all spins. Thus, at short 
distances gravity might not be so clearly defined: Hadrons couple to sums of various 
spin-2 fields, weighted by various functions of spin-0 (scalar) fields, and in a way that 
depends on the type of hadron. This means that the “equivalence principle”, which 
basically says to replace the flat-space Minkowski metric with the “curved” metric of 
gravity as a type of minimal coupling, holds only at macroscopic distances. (Similar 
remarks apply to nonminimal coupling, involving adding to the metric some function 
of the curvature tensor, which involves derivatives of the metric, and its covariant 
derivatives, which is possible even for weakly interacting particles.) For these and 
similar reasons, the success of general relativity at macroscopic distances should not 
be taken too seriously when applied to interactions at the submicroscopic scale, as 
in the earliest stages of cosmology (“inflation”) or the latest stages of gravitational 


collapse of stars (“black holes” ). 


A. ACTIONS 587 


__ IX. GENERAL RELATIVITY — 


Before discussing supergravity we need to study ordinary gravity. Both can be 
treated as generalizations of Yang-Mills theory. We use this approach rather than 
the traditional one, based on the metric, which is insufficient for describing spinors or 
supersymmetry: There is no useful definition of distance in anticommuting directions 


in curved (super)space. 


Gravity is the only observed long-range (massless) force mediated by a higher- 
spin (2) field. It is relevant for astrophysics, cosmology, and unification, all of which 


have applications to particles of lower spin. 


pMiiadodteaniebiy ees ie BR TIN ects scrcoesaisctnnsrecaners 


We begin with the general principles that define pure gravity as a nonabelian 


gauge theory, and use them to derive actions and couple to matter. 


1. Gauge invariance 


General relativity can be described by a simple extension of the methods used to 
describe Yang-Mills theory. The first thing to understand is the gauge group. We start 
with coordinate transformations, which are the local generalization of translations, 
since gravity is defined to be the force that couples to energy-momentum in the same 
way that electromagnetism couples to charge. However, these are not enough to define 
spinors. This is easy to see already from the linear part of coordinate transformations: 
Whereas SO(3,1) is the same Lie group as SL(2,C), GL(4) (a Wick rotation of U(4)) 
does not have a corresponding covering group; there is no way to take the square root 
of a vector under coordinate transformations. So we include Lorentz transformations 
as an additional local group. We therefore have a coordinate transformation group, 
which includes translations and the orbital part of Lorentz transformations, and a 


local Lorentz group, which includes the spin part of Lorentz transformations. 


Clearly the coordinates 7” themselves, and therefore their partial derivatives 0,, 
are not affected by the (spin) Lorentz generators. We indicate this by use of “curved” 
vector indices m,n,..... On the other hand, all spinors should be acted on by the 
Lorentz generators, so we give them “flat” indices a, 3, .., and we also have flat vector 
indices a, b, ... for vectors that appear by squaring spinors. Flat indices can be treated 
the same way as in flat space, with metrics Cag and 1, to raise, lower, and contract 
them. 
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Some gravity texts, particularly the more mathematical ones, emphasize the use 
of “index-free notation”. An example of such notation is matrix notation: Matrix 
notation is useful only for objects with two indices or fewer, as we saw in our treatment 
of spinor indices in chapter II. Such mathematical texts consider the use of indices 
as tantamount to specifying a choice of basis; on the contrary, as we have seen in 
previous chapters, indices in covariant equations usually act only (1) as place holders, 
indicating where contractions are made and how to associate tensors on either side of 
equations, and (2) as mnemonics, reminding us of representations and transformation 
properties. Thus, the full content of the equation can be seen at a glance. In contrast, 
many mathematical-style equations (when indeed equal signs are actually used) say 
little more than “A = B”, with the real content of the equation buried in the text of 


preceding paragraphs. 
We therefore define the elements of the group as 
g=e’, A= Nr" On + 5A” Moa 
where 0,,, acts on all coordinates, including the arguments of the real gauge parameters 
A™ and A” and any fields. M,,» = —Myq are the second-quantized Lorentz generators: 


They act on all flat indices, including those on \” and any fields that carry flat 


indices. As a shorthand notation, sometimes we will also write 
5A" Moa = 1 Mz 
(and similarly for other appearances of the antisymmetric index pair ab). We thus 


have a combination of the matrix generators of section IB and the coordinate gener- 


ators of subsection IC2. 


Exercise IXA1.1 
Sometimes it’s more convenient to perform explicit finite coordinate transfor- 
mations in terms of new coordinates as functions of old, as in subsection IC2. 
As an example for curved space we consider the sphere in arbitrary dimen- 
sions. Rather than the usual cumbersome angles, which introduce trigono- 
metric functions into measurements of distances, we use coordinates which 
manifest the slightly smaller rotational invariance of the corresponding flat 


space, as we did for scalar fields in subsection IVA2. 


a As in subsection IVA2, we can derive coordinates for the sphere by constrain- 
ing flat space in Cartesian coordinates to have unit radius. Rather than 
looking for an explicit solution as in subsection IVA2, we can enforce the 


constraint by the replacement 


y 
oS — 
|| 
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so the flat coordinate “vector” x automatically has magnitude |x| = 1 at the 
expense of introducing the scale invariance 


y=XAyy (x =2) 
Show the infinitesimal distance ds is given by 


- aye (qy- da)2 (y [a al \2 
Mage = ve _ (y- dy) - (y dy ) 


Check scale invariance of the last form. 


b Ultimately we’ll need to use the scale invariance to eliminate one coordinate. 
Writing y* = (y°, y’), consider the coordinate transformation 


y= (2 — (27, yf = Dee! 


(This is just a generalization of the substitution used in subsection IVA2.) 
What is the interpretation in two dimensions (y’ = y') in terms of complex 


coordinates? (This generalizes to quaternions in four dimensions.) Show that 


this results in A 
(22!dztl)? 


(P+ OF 
Compare the result on ds? of the scale gauge y° = 1 (on the previous form) 
to that of z° = 1 (on this form). 

Exercise IXA1.2 
More general scale gauges for the previous problem come from considering 


ds? = 


the kind of projections made in map making, looking at the result of shining 
a point light source through a transparent globe onto a plane, where the ray 
from the source through the center of the globe exits it at the point tangent 
to the plane. Instead of looking at the geometry of the rays, we consider 
expanding this globe of unit radius through the plane in such a way that 
the source remains at the same scaled position inside the globe. (The center 
of the sphere moves while the source and plane remain fixed, at least with 
respect to each other.) The globe continues to expand until it intersects a 
chosen point on the plane. Explicitly, in terms of coordinates y of that point 
with respect to the origin of the expanded globe, the distance on the original 
(unit-radius) globe is 2 
\ yl 
while the position of the source is 


a=-a, 2i=0 > y,=(-alyl,0) 
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in terms of the constant a that defines the gauge (projection), so the condition 


that it hasn’t moved relative to the plane is 
yo +aly|=1+a 


a Show the solution is 
g Leer 1 & 1+ b(y*)? ,-i74 
— 2b ~ Te 


Or 


2 = V/14 W(t)? 
b Find ds? in terms of both y’ and z for the special cases 


gnomonic: a=0 (b=1) 
stereographic: a=1 (b=0) 
orthographic: a=co (b=-1l) 


In general, the relation between first- and second-quantized group generators is 
the same as the relation between active and passive transformations, and the relation 
between a matrix representation and the corresponding coordinate representation, 
as discussed in subsection IC1, where in this case the fields are the coordinates. In 
particular, the second-quantized Lorentz operators M,, have the same action as the 
first-quantized Lorentz operators S,, introduced in subsection ITB1: For any field ~, 
M(w| = (|S, etc. The action of the Lorentz generators on vector indices is thus 


given by 
[Mav, Ve] = Viatje => A"[Mr, Va] = $°"[Mea, Va] = Aa’Vi 
This implies the commutation relations 
[Ma, M4] = —6, My" 


In explicit calculations, only two indices in the commutator will match, and they 


reduce to simple expressions such as 
[Mi2, V2] = N22Vi, [Mi2, Mo3] = N22Mi3 


As for derivatives, when acting on functions instead of operators we can write the 


action of the Lorentz generator as simply M,,V. without the commutator. 
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When spinors are involved in four dimensions, it’s simpler to convert all flat 


indices to spinor indices. In that case, we can write 
A= XO + Mz, My = 2% Mig = 4° Mga + 2PM, 


[Mag; by] = Wal yy > MIMy, Wal =. 5x7 [Mya, Worl = Noh We 
[Mag, M”] = 61 Mpy® 


in terms of the SL(2,C) generators Mag = Mga. Note that (Mag)! = +M 45 because 


(M.»)' = —M.». We have used conventions consistent with OSp generators 
s\P° (Mcp, wa} — Na’ vp, AB >= (Nab, Cag, C's5) 


Relating vector to spinor indices as usual as V, = Vag, etc., then fixes the Lorentz 
subgroup of the OSp group as (see exercise IIB7.2a) 


M.. 


oda V1 = Vad CayC gs = V5gCayCay 


= —3(CagVaqC ayy + CapVyaCgy4) = —3(CypMag + CapMys)V74 


= iM 2s 


adh = ~2(Cag Map + CapMga) 


=> d 0d38 — C'x3A08 + Capr55 


For most of the remaining discussion of gravity, we'll limit ourselves to bosonic fields 
in vector notation, which is easy to generalize to arbitrary dimensions. For spinors, 
we must either choose a dimension and use its corresponding spinor notation (for 


D <6), or work in mixed spinor-vector notation (which is much messier). 


Matter representations of the group work similarly to Yang-Mills. We define such 


fields to have only flat indices. Then their transformation law is 
yey 


where the transformation of a general Lorentz representation follows from that for a 
vector (or spinor, if we include them), as defined above. Alternatively, the transforma- 
tion of a vector could be defined with curved indices, being the adjoint representation 


of the coordinate group: 
Ve", = VaVe, Se ve 


However, as in Yang-Mills theory, it is more convenient to identify only the gauge 
field as an operator in the group. In any case, only the adjoint representation (and 


direct products of it) has such a nice operator interpretation. 
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As an example of this algebra, we now work out the commutator of two transfor- 
mations in gory detail: We first recall that the coordinate transformation commutator 
was already worked out in subsection IC2, using the usual quantum mechanical rela- 


tions (see also subsection [A1) 
If, f=(8,0)=0, 10, fl = (Pf) 
for any function f. For the Lorentz algebra we will use the additional identities 
[Mons Om) = [Mais A] = [Moa AM Ma] = 0 


all expressing the fact the Lorentz generators commute with anything lacking free flat 


indices (i.e., Lorentz scalars). The commutator algebra is then 


[AT Omn + 5A0? Moa, A30n + SAS" Mac] 
= NTLOm, N51 On = NT LOm, 5A) Moa + SAG? Moa, AS] Mae 
= (MAT )Om + FALOmAZ + APAIe?) Moa 


One fine point to worry about: We may consider spaces with nontrivial topologies, 
where it is not possible to choose a single coordinate system for the entire space. For 
example, on a sphere spherical coordinates have singularities at the two poles, where 
varying the longitude gives the same point and not a line. (However, the sphere can 
be described by coordinates with only one singular point.) We then either treat such 
points by a limiting procedure, or choose different sets of nonsingular coordinates on 


different regions (“patches”) and join them to cover the space. 


2. Covariant derivatives 


We can also define covariant derivatives in a manner similar to Yang-Mills theory; 
however, since 0,, is now one of the generators, the “O” term can be absorbed into 
the “A” term of V=0O+ A: 


m 1, be 
Va =a On + 5Wa Mop 


in terms of the“vierbein (tetrad)” e," and “Lorentz connection” w,'. Now the action 
of the covariant derivative on matter fields looks even more similar to the gauge 


transformations: e.g., 
do = rN” Om, Va — ea mP 


OV, =r" OmVa + Aa’ Vi, VaVo = €a” OmVs + Wav Ve 
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Oe = A” OnWe oF ra’ We, Vale = Ca” Omg + Wap’ Py 


Le., the covariant derivative V, is essentially D elements (labeled by “a”) of the 


gauge algebra. 
Exercise IX A2.1 


Write the transformation law and covariant derivative of an antisymmetric 


tensor in spinor notation (fag), and compare to vector notation as above. 
Note that the free index on the covariant derivative is flat so that it transforms 
nontrivially under 
V_=¢V.E" 
Explicitly, for an infinitesimal transformation 6V = |A, V] we have 


6a, = (NX Oye, = e,"0, A" A es” 


be = D dae ers ha as (=6,70,\°° 4: hae ee x Natwae”) 


This commutator is the same as for [\;, Az] in the previous subsection, except for the 
two additional terms coming from the Lorentz generators acting on the free index on 
V..- In particular, the vierbein e,™ transforms on its flat index as the vector (defining) 
representation of the local Lorentz group, and on its curved index (and argument) 
as the vector (adjoint) representation of the coordinate group. Also, it should be 
invertible, since originally we had V = 0+ A: We want to be able to separate out 
the flat space part as e,” = 0” +h,” for perturbation theory or weak gravitational 


fields. That means we can use it to convert between curved and flat indices: 
V™=V%e,” & V%=V"™e,,"% 


“ is the inverse of e,”. Furthermore, if we want to define the covariant 


where em 
derivative of an object with curved indices, we can simply flatten its indices, take the 


covariant derivative with V, and then unflatten its indices. 


Flat indices are the natural way to describe tensors: (1) They are the only way 
to describe half-(odd-)integer spin. (2) Even for integer spin, they correspond to the 
way components are actually measured. In fact, the above conversion of vectors from 
curved to flat indices is exactly the one you learned in your freshman physics course! 
The special cases you saw there were curvilinear coordinates (polar or spherical) for 
flat space. Then e,"” was the usual orthonormal basis. Only the notation was different: 
Using Gibbs’ notation for the curved but not the flat indices, V= V“ée,, where, e.g., 
a = (r,0,@) for spherical coordinates and & = (7,0, d) are the usual orthonormal 


basis. Thus, you probably learned about the vierbein years before you ever saw a 
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“metric tensor”. Similarly, when you learned how to integrate over the volume element 
of spherical coordinates, you found it from this basis, and only learned much later (if 
yet) to express it in terms of the square root of the determinant of the metric. (With 
the orthonormal basis, there was no square root to take; the determinant came from 
the cross product.) You also learned how to do this for curved space: Considering 
again the sphere, vectors in the sphere itself can be expressed in terms of just 6 and 
. And the area element of the sphere (the volume element of this smaller space) you 


again found from this basis. 


For example, consider nonrelativistic momentum in two flat spatial dimensions, 
but in polar coordinates: Now using x” to represent just the nonrelativistic spatial 
coordinates, 


= (78), pp” =m— = (mi, mA) =a", 


Then the two components of p* (with the simplest choice of e,’”) are the usual com- 
ponents of momentum in the radial and angular directions. On the other hand, one 
component of p” is still the radial component of the momentum, while the other 
component of p™” is the angular momentum — a useful quantity, but not normally 
considered as a component along with the radial momentum, which doesn’t even have 
the same engineering dimensions. In writing the Hamiltonian, one simply squares p®% 


in the naive way, whereas squaring p™ would require use of the metric. 


Exercise IXA2.2 
Show that the above choice of e,” actually describes flat space: Use the fact 
that p* transforms as a scalar under the coordinate transformations that ex- 
press r and @ in terms of Cartesian coordinates x and y, and as a vector under 
local “Lorentz” transformations, which are in this case just 2D rotations, to 


transform it to the usual Cartesian p’* = (ma, my). 


This direct conversion between curved and flat indices also leads directly to the 
covariant generalization of length: In terms of momentum (as would appear in the 


action for the classical mechanics of the particle), 


ay” 
o' = ma —m =p? =p*p’ngs => —ds? = dx™dxem en na = dz™ dx" Gmn 
S 


Equivalently, the metric tensor gm, is just the conversion of the flat-space metric Nap 


to curved indices. Also, in terms of differential forms, 


adn "e" = =a S'S 
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These curved generalizations of the energy-momentum relation and definition of 
proper time imply the corresponding generalization of the definitions of timelike, 


lightlike, and spacelike. 
The field strengths are also defined as in Yang-Mills: 


Vas Vo — ta Ve on 5 Rav’ Mac 


where we have expanded the field strengths over V and M rather than 0 and M so 


that the “torsion” T’ and “curvature” R are manifestly covariant: 
M=0Me*=M = T'=eTe*, R' =e'Re* 


The commutator can be evaluated as before, with the same change as for going from 
[A1, Ag] to [A, Val (i-e., now there are two free indices on which the Lorentz generators 
can act), except that now we rearrange terms to convert Om — €q'"Om — Va. Making 
the further definitions 


€a = ea Os leas ep i Cai: Ca => Cae = (Gees) Jem = =, €h Omen 
for the “structure functions” c,»°, we find the explicit expressions 
d 
Tae = Cap + Whab]® = a ee a Elm Wnjd’) 


cd cd e,, cd ce d mn cd ce d 
Rab = Cfab] — Cab We + Wia Whe = €a Cb (OfmWn] + Wim Wnje ) 


If we ignore the action of V on curved indices (it doesn’t act on them, but alternatively 


we could flatten them, act, then curve them back), we can also write 


se _ = Vine [Vis Val = 5Rimn” Moa 


where 
a 1 ab 


is essentially a covariant derivative for the Lorentz group only. 


From this expression for the torsion we find the following expressions for the curl 


and divergence of a vector in terms of curved indices: Defining 
e = det e,” 


=> cha’ = (ep™Omea”)en’ = (es Omea” en” — ea [(Omes” )en”] = Omea™ — ea Omin © 


we have 
Ga Ge OV = €ja(€y)"" Vn) = Ga Ge” Vay = V [aVo] _ Ta Vo 
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one VV" = 6." OnV tig VS Va ea VO 


Exercise IX A2.3 
Relate the two above identities by comparing (in D=4) VjaVica (generalizing 
VaVe)) and VaV" for Vabe = €abeaV *: 


In practice, a useful way to evaluate the commutator is to first evaluate the com- 
mutators of the Lorentz generators with the whole covariant derivative, and then 
subtract out the double-counted [M, M] term. This is particularly convenient when 
considering some explicit solution to the field equations with a reduced set of com- 
ponents (e.g., spherically symmetric), so that explicit indices may be lost except on 


the Lorentz generators. Schematically, we then calculate 


[Vi, V2 = [ey + WwW, €2 + Wo] 


= {le1, e9| + (€1W2)Mo = (€2w1)M;} + {wi[M,, V2] _ we[Mo, Vi _ wywe|M1, Mo)} 


This method turns out to be one of the two simplest ways to calculate explicit solutions 
(as opposed to discussing general properties). (For examples, see subsection IXC5 
below.) 


The covariant derivative satisfies the Bianchi (Jacobi) identities 
0 = [Vja; [Vo, Val] = [Vjas Teey?Va + 5Roe]*” Meal 


= (VjaTbe*) Va + §(V [a Req”) Mea — That\° (Tela? V5 + 5 Rela? Mop) — Riase*Va 
> Risa = Vila? _ Tear la V aoe _ Thar}? Repje\” =0 


To make the transformation laws manifestly covariant we can define instead 
N= Va + 5A" Moa 


which is just a redefinition of the gauge parameters. The infinitesimal transformation 


law of the covariant derivative is then 


5Va = [(6ea™)em Vo + (Ca Sum”) Mey = [°V, + 4A Mos, Val 
= (—Var? + AT ea? + Aa’) V5 + 3(—Var” + A" Raa”) Mes 


=> (Sea™)em® = Vad? + AT ea? + Aa?, ea’ ™Swnh? = —Var + 7 Raa” 


Exercise IXA2.4 


Show that a finite local Lorentz transformation takes the form 


Vi = Aa’ [Vs — 3(VeA%) A%¢ Mac] 
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in the case of vanishing torsion by starting with the more general expression 
v. = A.V» i 5A Mop 


examining the commutator [V’, V’], and applying T’ = 0 to determine A. 


A “Killing vector” is a transformation that leaves the covariant derivative invari- 
ant: 


Val =0; K = K°V,+ 35K? Mia 


(The term is usually used to refer to just the general coordinate part K™ of the 
transformation, but we’ll use it in a generalized sense to refer to the complete K.) 
It represents a symmetry; the existence of Killing vectors depends on the particular 
space described by the covariant derivative. It then follows from the Jacobi identity 
for [Ay, [Ko, V]] that the Killing vectors form a group, the symmetry group of that 


space. Invariance of the covariant derivative requires: 
—ViKO+K Tq? + Ka? =0 => V (a/Xb) POT slabs Ka = sViaky =, 5K °T eat] 


=V kK as K* Ry, =i) = Vik" = Ken. 


These equations are referred to as the “Killing equations”. (Again, usually it is just 
the first equation, on K,, that is called by this name, but we’ll use it to refer also 
to the equations for Kay, which are needed to describe the symmetry when acting on 
spinors, etc.) 
Exercise IXA2.5 
Express the Hamiltonian of the classical relativistic particle in terms of the 
vierbein: 


oF ee = 5(’eaep eat m?) 


Doing the same for general coordinate transformations K = K“%e,, examine 
the condition for invariance [K, H] = 0 using the Poisson bracket. Using the 
commutation relations for the e,’s, show that this implies the Killing equation 
V (ako) = KoT (ab): 

Exercise IXA2.6 
Solve the Killing equations explicitly in the case of flat space V, = 0,. Show 


this gives the Poincaré group, including both orbital and spin pieces. 
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3. Conditions 


There are two kinds of conditions we can impose to eliminate some degrees of 
freedom: gauge choices and constraints. Gauge choices explicitly determine degrees 
of freedom that drop out of the action anyway. If the gauge is not completely fixed, 
the form of the residual gauge transformations may change, since using particular 
gauge parameters to fix the gauge, rather than eliminating those parameters, may 
just determine them in terms of the remaining parameters: We require that the 
residual transformations do not violate the gauge conditions that have already been 
applied. Similar remarks apply to global symmetries: If they do not commute with 
the gauge transformations for the gauge that was fixed, then they may aquire extra 
gauge-transformation terms to preserve the gauge choice. On the other hand, con- 
straints are chosen to be covariant under the transformation laws, and thus do not 
alter them, while eliminating degrees of freedom that might otherwise appear in the 
action (although not in all possible terms). Furthermore, the simplest explicit solu- 
tion to constraints can itself introduce new gauge invariances. (An example of this 
situation is supersymmetric Yang-Mills: see subsections [VC3-4.) In this subsection 
this analysis will be applied to Lorentz invariance: We already saw that global Lorentz 
transformations are included in coordinate transformations, and that local Lorentz 
invariance is unnecessary when only integer spin (and in particular, pure gravity) is 
treated. We now examine the consequences of eliminating this useful but redundant 


invariance and the gauge field associated with it. 


Of course, we can eliminate local Lorentz transformations by hiding flat indices: 


For the vierbein itself, we have the local Lorentz invariant 


gt™ = neq™eg” 
which is the inverse “metric tensor”. However, we have seen that tensors with flat 
indices have simpler coordinate transformations, and there is no way to get rid of flat 
indices when spinors are involved. Furthermore, the metric has the constraint that it 
have Minkowski signature: This constraint is solved by expressing the metric in terms 
of the flat-space Minkowsi metric 7 and the vierbein. Thus, solving the constraint 
introduces local Lorentz invariance. (However, in this case the constraint does not 


eliminate degrees of freedom, but only limits their range.) 


The Lorentz transformations in \®? are redundant to those in \”. The extra 
gauge parameters also can be fixed by an appropriate gauge choice: For example, 


consider the gauge 


de,” = rg’ep” = Lorentz gauge Nats” = 0 
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A coordinate transformation takes us to a different Lorenz gauge, since the Lorenz 
gauge condition is not a scalar. This means that any coordinate transformation \” 
must be accompanied by a Lorentz transformation \” to preserve this gauge, where 
this A®’ is completely determined in terms of \”. This is easy to see perturbing e,’” 


about 0’": To lowest order we have simply 
0= b(Mm|aev}””) ~y —2Xad + Oars] => \ab ~ SOaA8] 


Exercise IXA3.1 


Let’s further analyze this gauge condition: 


a By looking at the transformation of a vector, identify the specific terms in 
the Taylor expansions of \" and A“ whose coefficients can be identified with 


global Lorentz transformations, in the approximation used above. 


b Using the same methods as exercise [VC4.3, and writing in matrix notation 
e,’” = (e"),™ for some matrix h, solve explicitly for A? in terms of \’" and 


€,”” to all orders. 


Similarly, the Lorentz connection w,” that gauges the \” transformations is 
redundant to the vierbein that gauges \": w can be completely determined in terms 
of e by constraining the torsion to vanish. To see this, we first notice that in the 
general case the expression for the torsion in terms of the structure functions and 
connection can be inverted to give the connection in terms of the other two. One 
way to do this is to use the definition and permute the indices a > b — c (odd 
permutations are redundant because of the antisymmetry of the equation in the first 


two indices): 
Tobe = Cabe + Wabe — Wha, Dad = Coca + Wbea — Webas Tos = Ccab + Weab — Wach 


Using the antisymmetry of the connection in its last two indices, we add the first and 


last equation and subtract the second to obtain 


Wabe = 5 (Chea = Calta) Cote = Cate — Lobe 
Since the torsion is a covariant tensor, we can freely set it to vanish without affecting 
the transformation laws of the remaining objects (it’s a covariant constraint, not a 
gauge condition): 


1 
Top =0 > Wate = 5 (Coca = Calpe} 


From now on we assume this constraint is satisfied. This simplifies the form of curls 


and divergences, which implies that V can be integrated by parts in covariant actions 
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(see below). However, we have already seen that the torsion is nonvanishing in su- 
perspace (subsection IVC3): In that case the symmetry on flat indices is constrained, 
so the connection has fewer components than the torsion, and can be determined by 


setting only part of the torsion to vanish. (See subsection XA1 below.) 


Exercise IXA3.2 
Show explicitly that when the torsion vanishes the Killing equations from 


dé,” = 0 imply those from dw,” = 0: 
Vej=0 Ae=iVieg = VAP =e,” 


Exercise IXA3.3 
Consider using the group GL(D) on the flat indices instead of SO(D—1,1). 
(This construction is not useful for fermions.) Compensate for the extra gauge 
invariance by replacing the Minkowski metric 7,, with a “flat”-index metric 


Jap (and its inverse g®’) that is coordinate dependent, but covariantly constant: 
A= Om + Na’ Ge"; Na [Go", V. = ig Vis Na [Go", V4 = =r A,” 


Ve =€,+ wes Ge’, ke Vo = 1 Ve aa Rare?Ga® 


g nm s,s", i ii = ‘ij 


where now there is no (anti)symmetry associated with the indices on G,° (or 
Ae”, etc.). As a result, gap transforms nontrivially under both coordinate and 


GL(D) transformations. Use it (in place of 7) to raise and lower flat indices. 
a Find the explicit expressions for the torsion and curvature in terms of the 
vierbein and connection. Solve these, and Vg = 0, for the connection in 
terms of the torsion and vierbein as 
Wabe = $(Cbea _ Calbe]) is 5(€cGab _ €(a9b)c) Cabe = Cabe — Tate 


Show that there exists a GL(D) gauge 


Jab = Mab 


(assuming gap has the right signature), that gauge has as a residual flat-index 
invariance SO(D—1,1), and the resulting covariant derivative is identical to 


that used earlier in this subsection. 


b Show that one can instead choose a GL(D) gauge 
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and that this completely fixes the GL(D) invariance. Since the vierbein has 
a curved index, the covariant derivatives are no longer covariant: Unlike the 
previous gauge, to maintain this gauge any coordinate transformation must 
be accompanied by a GL(D) transformation whose parameter is determined 
by the coordinate transformation parameter. Find the solution for \,? in 
terms of A’ in the infinitesimal case. Compare with the transformation law 
for curved indices (see subsection IC2). In this gauge the connection is known 
as the “Christoffel symbols” . 


The vanishing of the torsion simplifies the Bianchi identities on the curvature: 
Cs =0 > Riabeja = V [aPocjde =0 > Fiagea a Teas 


In terms of SU(N)-like Young tableaux, this means the curvature is of the form HH. 


For SO(N) Young tableaux, we subtract out the trace pieces: 
Ravea ~ FH ee Mee 


where the first term is the “Weyl tensor” Wanca (traceless), the last two terms combine 
to give the “Ricci tensor” Ra, = Ra%bc, and the last (singlet) term is the “Ricci scalar” 
R= R*, = R®». They’re simpler in spinor notation in D = 4: Since [ab] > (a) 
and (a8), 

Rabved > Rag) (48) © (R(as)(78) = Weasys) + Cla(yCoyay) 


in terms of Weyl W(agys), the traceless part of Ricci Rap) and the Ricci scalar 


ap)? 
R. Later we’ll see that the Ricci tensor is fixed exactly by the equations of motion. 
That leaves the Weyl tensor as the on-shell field strength. As explained in subsection 


IIB7, it describes helicity +2. 
Exercise IXA3.4 
Prove that Rabea = Redap follows from the Bianchi identity Rjq,qq = 0 and the 


antisymmetry of Rebcq in both ab and cd. 


4. Integration 


The antihermitian form of the group generators was a convenient choice because 
partial derivatives are antihermitian, and the generators of the Lorentz group (which 
is real and orthogonal) are antisymmetric in the vector representation. Thus, the 
generators are real. However, the group elements are not unitary, since hermitian 


conjugation reorders A with respect to O,,. The fix comes from noticing that 


e=dete,” => dlne=e,,"be,” = X"0,|n e — O,rA” 
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=> 6(e')= aoe eNO, = je ySe%e 
where the derivatives Fy act on everything to the left, \ now includes just coordinate 
transformations, and we have exponentiated by the same method as for Lie groups 
in subsection IA3. (Note that if we expand the exponential in a Taylor series such 
derivatives in all but the first factor will hit \’s, just as for those in e* acting to the 
right.) Any function that transforms in this way is known as a “density” (see subsec- 
tion IIIB1 for the 1D case). We can easily see from the infinitesimal transformation 


that a density times any scalar is also a density. This allows invariant actions to be 


S= pu e'L 


for any scalar L. For cases without spinors we can also use 


constructed as 


G=lt Gane" =. eta vaG 


where gmn is the inverse of g’”. (In spaces of general signature, i.e., arbitrary numbers 
of time dimensions, we should write ,/|g| so, e.g., in Euclidean space we actually use 
/g. If we were even more general, and used |det | # 1, then it would also appear.) 


This can also be understood in terms of differential forms, since 


a m a 4 m n 0, 1, 2,3 _ 74 —1 
2° = dx em, =. PSH de ar dd ee, Cn Gy ee =e ee 


a =a): (/ a1) = fo 


under coordinate transformations. 
Exercise IX A4.1 
Let’s look at some properties of transformations acting backwards: 
a Show that for any function f 


M=D =I] = ef=efe*=e->fe* 


and use it to show that the product of e7! 


1 


with any scalar transforms the 


same way as e— 


b Derive 


(i.e., is a density) under a finite coordinate transformation. 


fe = (1 : 3) (e*f) 


(where the derivatives in each factor of \ act on everything to the left, but 


vanish on “1”), 
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Exercise [IX A4.2 
We now examine finite transformations in terms of transformed coordinates 


(see subsection IC2): 


b Show that 


by evaluating 


pu e(¢)= pu e*(a’), dx’ = dx det (=) 
ey 


c Show that 


by similarly evaluating 


pu Figs fe wise fu ela) 


From the results of subsection IXA2, we then have that covariant derivatives can 


be integrated by parts in such actions, since 
G9, = i dz eV ,V* = / dr Bye V™ 


Exercise IX A4.3 
Derive the expression for the covariant divergence in terms of e and the partial 


divergence by assuming integration by parts: 
[etev.v = - f ervev.¢ 


Use this to find a simple form for the covariant d’Alembertian on a scalar: 


H 

Lie = Vo = [=U mnV oe ee nP 
Vg 

Actions for matter are constructed in a similar way to Yang-Mills: Starting with 

the flat-space action, replace ordinary derivatives with covariant derivatives. The new 


ingredient is the extra factor of e7!. 


This prescription, as for Yang-Mills, is unam- 
biguous up to only field-strength (curvature) terms, which can usually be eliminated 
by symmetry requirements and dimensional analysis. (At least for low energies, we 
want terms of the lowest mass dimension.) This uniqueness (at low energies or long 
distances) is known as the “equivalence principle”: Inertial “mass” (really energy, 
but also momentum), as determined by the kinetic term, is the same as gravitational 


mass, as determined by the coupling of the gravitational field. 
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A simple example of matter is a real scalar field: 
S= fet +m’ x? + aRx’] 


The constant a can sometimes be fixed by symmetry: In the massless case, to preserve 
the global symmetry dy = €, we must have a = 0. (For self-interacting scalars, this 
generalizes to a global nonabelian symmetry.) To preserve conformal symmetry (see 


1 D-2 
4D-1'° 


subsection IXA7), also for the massless case, we need a = 
This form of actions in terms of scalar Lagrangians also suggests we modify the 
definition of functional variation for convenience and covariance: 
6S 
6S = ew "bz, 
or, equivalently, we use the covariant form of the 6 function, 
d(x) 


Téa’) e(x)d(a — 2’) 


As in flat space, the action for electromagnetism follows from gauge invariance: 
Sao) e ia | J=99"" Gg Fp Fng 


where Finn = OfmAnj. Integration by parts then gives a simple form for Maxwell’s 
equations. Such simple covariant equations of motion that don’t require explicit 
expressions for the Lorentz connection appear only for antisymmetric tensors (which 


in practice means just spin 0 and 1 in 4D). 


Exercise IX A4.4 


Methods related to differential forms can be applied to these special cases: 


a Rewrite the above action for electromagnetism in terms of A, and covariant 
derivatives. Find the field equations following from both forms of the action, 
and use this to find a simple expression for the covariant divergence of an 
antisymmetric tensor with curved indices using just the metric. Compare the 


results of the previous exercise. 


b By converting flat indices on the covariant tensor €gpcq to curved, show that 
a d 1 mnpq 
—GJEmnpqg AN =a 


are also covariant tensors. Use these, and the covariance of the curl (see 
subsection IC2), to arrive at the same expression for the covariant divergence 


of an antisymmetric tensor. 
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Another example is a Dirac spinor: 
= -liy/~a; m 
s= fe Wy iVa t+ T5)W 


where y* are the usual constant Dirac matrices, in terms of which the spin operator 
appearing in V is the usual —M,, — Sa = FV 0 Vs}: In 4D, we can rewrite this in 
spinor notation by simply replacing 0, a Vu 3 in the flat-space expressions given in 
subsection II[A4, and replacing Ma, — Mag as described in subsection IXA1, as well 
as [d*z > fd*ret. 


5. Gravity 


The Einstein-Hilbert action for gravity follows from choosing the only available 


scalar second-order in derivatives, the Ricci scalar: 


Lg = —}R= —} Res" 


L 
4 
This action normally has a coefficient of 1/K? (compare Yang-Mills), but we’ll gener- 
ally use (natural/Planck) units « = 1; then « is used only to parametrize expansion 
about the vacuum and define the weak-field limit. (Actually, Planck units normally 
use G = 1, whereas in our conventions k = 1 — G = 7.) In any case, the «’s can 
always be absorbed (unlike Yang-Mills) by a field redefinition of e,”", and then appear 
only in the definition of the “vacuum” (perturbative ground state, or solution that 


defines the boundary conditions at infinity): 


(o") = K2/(D-2) 6 m 


This makes e,""0,,, and thus dx’e,,“ and ds”, dimensionless. In this sense, gravity is 
a theory with “spontaneous breakdown” of conformal invariance: Coordinate trans- 
formations include conformal transformations, but this invariance is broken by the 


vacuum, which introduces a length scale («). 


Exercise IXA5.1 
Consider the covariant derivative for nonvanishing torsion. By solving for the 
Lorentz connection in terms of the structure functions and torsion, express the 
covariant derivative in terms of the torsion-free covariant derivative Vv and the 
torsion. Thus, any action in terms of V can be rewritten in terms of Vv and T, 
so any theory with a nonvanishing torsion is equivalent to a similar one with 
vanishing torsion (assuming the action is only second-order in derivatives of 


the vierbein, and thus algebraic in the torsion). Take the commutator of two 
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V’s to find the curvature in terms of the torsion-free curvature Piss Write 
fo} Oo 
the Einstein-Hilbert action with nonvanishing torsion in terms of R, V, and 
T, to find: 
R= R= (Ty P= FT Tee FAT Te — VT. 


Since the last term vanishes upon integration, T appears as an auxiliary field, 
so R is equivalent to just R. 
Exercise IXA5.2 
For some general applications, where the form of the vierbein is not specified, 
it is useful to have a more explicit expression for the action in terms of the 
vierbein. We found in subsection IXA2 that for vanishing torsion 
Wav" — ed,(e"'e,"°V") — enV" ~~ oe v* 
Use this to show 
R= (cap’)? + 40 Chea — be Cate — 2 Ome” On (Cae *)] 
We can drop the last term in the action integral under appropriate boundary 
conditions. (Hint: Use the result of the previous exercise for w = 0.) 
Exercise IXA5.3 


In two dimensions there is a single Lorentz generator, 


Ma =€aM => Va=eatwaM,  [Va, Vi] =—}eaRM 


a Show that the connection and the only surviving part of the curvature then 
take the simple forms 


Wa = —€qpeOme te’, R= —2ed,,[e?"d,(e1e,")] = —2e71 7 e,€ 


b Derive, for the sphere in spherical coordinates, 


~ vi @& % 
LY sin 6 7 


(Hint: First use ds? = dx™dx" gmp in 3D flat space.) Then show the covariant 


derivative is 


: = 1 
Vo = Oo; Vo= sin 0. 


c Use these results to calculate [dz e~'R for the sphere in two ways: (1) by 
showing R is a constant and pulling it out of the integral, and (2) by converting 
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it into a boundary term, where the “boundary” consists of infinitesimal circles 
around the coordinate singularities at the poles. (In general, even for spaces 


without true boundaries, one has to treat the boundaries of patches as such.) 


It’s also possible to add a “cosmological term” to the gravitational action: 
See A [ ae e! 


with the “cosmological constant” A. This term has no derivatives, and is thus anal- 
ogous to a mass term. However, it only contributes to the nonpropagating spin-0 
mode of the vierbein (see later), so it doesn’t give a physical mass, but does modify 


the vacuum. 


Exercise IXA5.4 
Show that the action for gravity can be made polynomial in e,™ by a field 


redefinition (rescaling) of the form 


when k takes the values 


n+l 
k= ——— es oe ee 
3" n 7p oe 


and that the resulting action is order Dn + 2 in the field. In what cases (of n 


and D) is the cosmological term also polynomial? 


The variation of the curvature can be obtained directly by varying its definition 
in terms of [V,V]. We start with the definition 


bea” =Ca’es™ > Ca’ = (Sea Jem’ 


and work in terms of the flattened object ¢,». Then we drop its Lorentz piece, choosing 
Cab = Cha. We find: 
6Va = Ca’Vi + 3M 


=> 5 (6 Rar) Mae = [Via, 5Val] = (Via6s}9) Ve — Ca 3 Rojo” Mea + $V faGny Mac + Cfat]°Ve 
=> Vado + Caaj° = 9, bRe = Vey” = Gage 
=> Cate = VieScla 
+ ~hy? = 3{Via; Via" = (Cin Feue + po“) 
de |= -e 16 Ine = —e 1em%5e,”™ = —e 10%, 


=> 6(e7*R) = 2e'[(n"O — V°V") + (RY — 3 R) Cab 
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where 0 = V“V,. Thus for pure gravity we have the field equations 
6Sqg=0 > Rav — 4Nwk = 0 > Ray = R=0 
while with a cosmological constant we have 


6SGt+Seos=0 => Ray—gnw(R-4A)=0 > Rw-FZnwR=0, R=4,2 A 


Note that calculating a variation is the same as performing a perturbation to 


lowest order: We will use this result in subsection IXB1. 


Exercise IXA5.5 
For gravity, a first-order formalism follows from not imposing the torsion con- 
straint (see exercise IXA5.1), so either the torsion or the Lorentz connection 


can be treated as the auxiliary variable. 


a Find a first-order action for gravity (in all D) by treating e,,7 and w,” as the 
independent variables. In D=4, using ¢«’’""”!, write this action as polynomial 


in these variables, eliminating the explicit e, to obtain 
a 1 mn a, b cd 
Se = fe a POG ition Cin, Tose 


with R,,°? in terms of just w. 


b Vary this action with respect to both e and w (independently) to find the 
field equations, expressed in terms of torsion and curvature, using 6[Vim, Vn| 
to find the variation of Ryq°? (see subsection IXA2). 


Exercise IXA5.6 
As discussed in subsection IIIC4 for Yang-Mills, in four dimensions we can 
write a complex first-order action for gravity that yields the usual gravity 
action up to a surface term. For Yang-Mills, the complex action was obtained 
by starting with a normal first-order formalism and replacing the auxiliary 
field with its self-dual part. 


a Starting with the first-order action of the previous problem, find the analog for 


gravity by keeping just the part of w,,” self-dual in ab, using spinor notation. 


b Associate the coupling « with the term quadratic in w (analogously to the 
Yang-Mills case). As for Yang-Mills, associate the self-dual theory with the 
limit « — 0. Find the equation for e,,% that follows from varying w in this 
case, and show that it is equivalent to setting the self-dual part of w,, to zero, 
where w is the usual torsion-free connection. Show this is equivalent to setting 


the self-dual part of the curvature R,»,,@ to vanish, in an appropriate gauge. 
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(Technically, this means we must either complexify the fields, or Wick rotate 


to 4+0 or 2+2 space+time dimensions, where the Lorentz group factorizes.) 


6. Energy-momentum 


In subsection IIIB4 we saw that in the same way as a current in electrodynam- 
ics or Yang-Mills is defined as the matter contribution to the gauge field’s equation 
of motion, 6S),/dA, = J® (in that case Sj, excludes only the pure Yang-Mills ac- 
tion), the “energy-momentum tensor” is defined as the matter contribution to the 


gravitational field equation (in this case Sy excludes only the pure gravity action): 


0S = fe CLS |v g(ogn” i = -4/ Vv G(O9mn)T ae 


The case where ¢,, represents the invariances of the action implies restrictions on this 
tensor: Using the separate gauge invariance of the matter action dgaugeSam = 0 and 
the matter field equations 6S),/d(matter) = 0 (as for the Yang-Mills case), gauge 


variation of the gravity fields in Sj, implies 


os Nob =—Ato = Tl =0: Lorentz 
ad —3V ars) => V,l%=0: coordinate 


so coordinate invariance of the action implies local conservation of energy-momentum. 


For example, for a real scalar field: 
S= fe "(Vx)? +m?x? + aRx?| 


=> 2Ty = (Vax)(Vox) — 3navl(Vx)? + m?x7] + af(nasO — VaVs) + (Rab — $navk)|x? 


Notice that for a 4 0, the energy-momentum tensor gets extra total-derivative terms 
which are separately conserved in flat space (since they come from the Ry? term, 


which is separately covariant). 


Exercise IXA6.1 
Show that for a = + (using the scalar’s free field equation) one obtains a result 
in agreement with that at the end of subsection IIIA4 in flat space. This is the 
simplest form of the energy-momentum tensor, and the most physical (since 
it involves only the relative momentum of the two fields, not the total). This 


choice for a is also favored by string theory, as we’ll see later. 


Exercise IX A6.2 


Using the action given in subsection IX A4 and the variation of the covariant 
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derivative from subsection IXA5, find the energy-momentum tensor for the 


Dirac spinor, and use its field equations to show this tensor is conserved. 


Note that this is not the same as ordinary conservation 0,7” = 0: { /—gT 
does not define a conserved total energy-momentum. This is in contrast with the 
conserved current in electrodynamics, since we then can derive the usual global con- 


servation law 
c= jer eV." = jer 0,e 1 J™ ~ & farts ee 


On the other hand, it’s closely related to Yang-Mills, where 6A, = —V,A leads to 
V.,J* = 0 in terms of the derivative V covariantized with respect to the Yang-Mills 
field (as well as gravity, if in curved space), so 0,,e7'J™ = —e7'[iAm, J™] 4 0 (see 
subsection IIIC1). 


However, if there is a Killing vector K,, then the component of momentum in 


that direction is conserved: 
Lair. = VP a(R ahaa =U 


(Remember V(,K;) = 0.) Some simple examples of this in flat space are (K,)’ = 6° 
(translational invariance), for which the corresponding “charge” is the total momen- 
tum, and (K,)’* = Pre (Lorentz invariance), for which the charge is the total angular 


momentum. 


Including the variation of the gravitational action, we get the gravitational field 
equations 
Lie = snap = aly 


Coordinate invariance of Sg implies V,(R® — $n%R) = 0, which also follows from 
the Bianchi identities: In that sense gauge invariance is said to be “dual” to Bianchi 
identities, one implying the other through variation of the action: In general, for any 


gauge field ¢@ with gauge parameter \ 


6S 6S 
6¢= OA 0=6S = | dzr(O\)— © OFT—=0 
= 02, fe one a 
where the “transpose” O7 is defined by integration by parts. Positivity of the energy 
(contained in any infinitesimal volume) is the condition T° > 0. The addition of 
the cosmological term modifies the left-hand side of the above equation of motion by 


adding a term 27,pA. 


Although there is no covariant definition of total energy-momentum, in the case 


where spacetime is asymptotically flat (the metric falls off to the flat metric sufficiently 
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fast at infinity), one can define a noncovariant energy-momentum tensor t,, for gravity 
itself which is covariant with respect to coordinate transformations that themselves 
fall off at infinity. (See exercise IIIC1.2 for the analogous Yang-Mills case.) This 
tensor satisfies 0,,(77" +t") = 0 (where T’”” is the usual tensor for matter), so the 
usual conservation laws can be derived for the total energy-momentum coming from 
integrating 7+ t. Many equivalent expressions exist for t. One way to derive it is to 


expand the field equations order-by-order in h as 
pitaer = 5Narl) = Ly — ta 


where Lz,» is the linearized part of the field equations (see subsection IXB1) and —t,y 


is the quadratic and higher-order parts. By the linearized Bianchi identities, we know 
j= O)s bad = On(¢R” = in’ R as ad =, One a2 oy 


where we used the field equations in the last step. Note that there is a great deal 
of ambiguity here: We could have linearized by expanding the metric around its flat 
space value instead of the vierbein, or by expanding R,,, or R™” instead of Ray, etc. 
Because of the expression in terms of L,, ~ OOh, the integral of T +t, which gives the 
total energy-momentum vector, can be expressed as a surface term, just as Gauss’ 
law in electrodynamics. Since space was assumed to be asymptotically flat, only the 
quadratic part of t contributes in the surface integral, which is why there is so much 
freedom in the definition of t. Since t is not covariant, the energy-momentum of 
the gravitational field is not localized (coordinate transformations shift it around). 
However, since the total energy-momentum is invariant, one can ask questions about 


how much energy is radiated to infinity, etc. 


7. Weyl scale 


The simplest way to describe conformal transformations in field theory is as a 
local scale transformation. If the theory is not coupled to gravity, we couple it to 
gravity as in Yang-Mills theory by replacing a Poincaré invariant Lagrangian L(0, W) 
with L(V,w) (where all fields 7 have flat indices), but also including the e~! factor 


in the action. We then transform the fields as 


m 


ea” > Beg™, p+ GFA, 


where @ is the gauge parameter and w + pe is the engineering dimension (scale 


weight) of the field w. (See subsection IIB1.) Effectively, e,’” has dimension 1, since 


it’s the only field with curved indices, and thus any derivative must appear in the 
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combination €,'0,,, while the measure appears as dx e7!. 


Of course, the action 
won't be locally scale invariant unless it is globally scale invariant, i.e., has only 


dimensionless coupling constants (and thus no masses). 


If the gravity-coupled theory is invariant under this local scale transformation, 
then the theory will be conformally invariant after decoupling gravity. This follows 
from the fact that the most general combined coordinate and local scale and Lorentz 
transformation that preserves the flat-space vierbein e,” = 0/” is exactly a conformal 
transformation. This is equivalent to our previous definition in terms of the scal- 
ing of the flat-space ds? under conformal transformations, since dz’dz'"g'_(2') = 


dz™ dx" Gmn(x) under coordinate transformations. 


Exercise IXA7.1 
Derive the usual conformal transformations by finding the most general local 
scale + Lorentz + coordinate transformation that preserves the flat-space 


vierbein. 


A simple example is Yang-Mills theory. We look at the Yang-Mills field with 
curved index, since its gauge transformation does not depend on the vierbein. (6Am = 
—OmA+... v8. 0Ag = —€g™OmA+....) To avoid interference with the Yang-Mills gauge 
transformation, the Yang-Mills field with curved index must be scale invariant. Then 
the action 


= <r ete ee Pent Finn ling 


transforms with a factor 6*~”, and so is invariant in D = 4 only. 


Exercise IXA7.2 
Consider a more general gauge field A and field strength F’ defined by 


0Am,--my — — ar Ohm Ama--my)) Fing-my 4a = HOt Amami] 


where A is totally antisymmetric in its N indices. (Such theories were en- 
countered in exercise I[B2.1b.) 

a Define an action in terms of F?. In what dimension D(J) is it conformally 
invariant? 

b Show that this theory is related by a “duality transformation” (switching 
Bianchi identities and field equations) to the theory with N’ indices on a new 
A, where N’ = D—2-—N, and D(N') = D(N). 

c Examine the cases N = D,D —1,D-— 2. Note that the scalar obtained by 


duality does not have an Rd? term in its action, and thus is conformal only 
i. 2) = 2, 
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Gravity is not scale invariant, but it will prove useful to examine its scale breaking 
explicitly. To preserve gauge covariance and dimensional analysis, the scale transfor- 


mation law of the covariant derivative must take the form 
Vi = Vat k(V°S) Mas 


where the @ scaling of e,’” was defined above, and the linearity of dw in @ follows 
from the homogeneity of V in e. (Alternatively, we could put in something more 
arbitrary, but it would be eliminated by the rest of the procedure anyway.) From the 
variation of commutation relations we then find 
5 Mac = [Vin Vil 
= &[Va, Vol + (1 — k)®(VieP) Voy + kO(V VP) Majo +k? (VP)? Mas 
> k=1, Ry =PRy™ + EVV — 56,55 (VO)? 

If we make the redefinition (at least for & positive) 


p= ae) 


then we find the very simple scaling law for the integrand of the Einstein-Hilbert 
action: 


(e'R)' = e7'(¢°R — 45-5609) 


Exercise IXA7.3 
Consider a scale factor that is invariant under a Killing vector (see subection 
IXA2). 


a Show the Killing vector survives the scale transformation; i.e., 
Vi = Vat (V'S)Ma, [K,Va] =[K,6]=0 => [K,Vi] =0 


directly using commutators (rather than the Killing equations). 


b Although the operator K is the same, the Killing vector is different: 
K = K°V,+ 4K® My = K"Vi, + 1K! Mia 


Find K’* and K’ in terms of K* and kK”. 
Exercise IXA7.4 
Covariant derivatives for flat space in spherical coordinates can be obtained 


from those of Cartesian coordinates by a combination of coordinate and lo- 


cal Lorentz (rotation) transformations. However, there are simpler methods, 
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using a combination of transformations of a single coordinate and Wey] scale 


transformations: 


a Take the direct product of a sphere with metric d{2? (in arbitrary coordinates) 
and a line as 
ds? = d(In r)? + dQ? 
Then derive flat space in spherical coordinates by making a scale transforma- 
tion 
6=1 
to yield the metric 
ds* = dr? + r7dQ? 


Show that the resulting covariant derivatives are 


Vy = O,, Vi = Ly, is M,i) 


O° 


where V; are the covariant derivatives on the sphere corresponding to the 
metric dQ2?. 

b Find V in terms of (r,6,@) using the result of exercise [IXA5.3. Find V in 
terms of r and conformally flat coordinates x‘ for the sphere by first deriving 
from flat 2D space by another Weyl scaling. 


Many special cases of covariant derivatives can be derived completely by Weyl 
scalings. This includes the most commonly used ones, for cosmology and for static 
spherical sources. The general procedure uses the following facts in the following 


order: 


(1) In a space of one dimension, we can choose 
D=1 > V=0 


(There is no curvature in D = 1.) 


(2) For a direct product space, i.e., where the metric ds? can be written as the sum 
of the metrics of two (or more) spaces, the problem for solving for the covariant 
derivatives is separable. We can divide up the components into the covariant 
derivative for one space and that of the other, each using only its own coordinates 


and flat indices (and thus Lorentz generators): 


ds? = ds? + ds3 = V= (Vi, V2) 


A. ACTIONS 615 


and similarly for the curvature. 


(3) Under a coordinate transformation, each component of the covariant derivative 
(and of the curvature) transforms as a scalar. We need only apply the redefinitions 


of the coordinates, including those that appear in the partial derivatives: 
Va(z) > Val2’) 
(This actually applies the alternative < definition of coordinate transformation of 


subsection IC2.) 


(4) Under a Wey] scale transformation, 
ds*=@ "ds > Vi =6V,+(V°S)Ma, 
Rip = PP’ Rast + BEV VAS — 56,65 (VO)? 
These steps can then be repeated as necessary. (The first two steps alone lead to 


Cartesian coordinates for flat space.) 


Exercise IXA7.5 
Use this method (as opposed to that of exercise [XA5.3) to derive the covariant 


derivatives for the sphere in the usual spherical coordinates: 


a Use steps (1) and (2) to find V for the flat space with metric 


ds* ade de 


b For step (3), apply the transformation 
dé 


= = 9 
ama e (u = In tan 5) 


c For step (4), use 
1 


sin 6 


to get the usual metric and covariant derivatives for the (2-)sphere. We also 
note that exercise IXA7.4a is just a repetition of these steps, for a new 1D 


coordinate v which is redefined as v = In r, with a new = 1/r. 


Consider a field theory without gravity that has a conformally invariant action. 
Spontaneous breakdown of scale invariance produces a Goldstone boson for that sym- 
metry, the “dilaton” (see subsection IVA7). Any theory can be made globally confor- 
mally invariant trivially by performing a local scale transformation and making the 


parameter the dilaton field. 
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The dilaton can also act as a Higgs field: If we couple the dilaton to conformal 
gravity (gravity with local Weyl scale invariance), the Higgs effect reduces conformal 
gravity to ordinary (Einstein) gravity. For example, if we introduce the dilaton into 
pure gravity by the local scale transformation above (in analogy to the Stiickelberg 


model), 


Up to an (important) overall negative factor, this is the action for a conformal scalar. 
The dilaton field ¢ is a compensator for local scale transformations, and acts as a Higgs 
field for this gauge symmetry: By gauging it to its vacuum value (¢) = 4, we regain 
the usual form of the gravity action. (Alternatively, we can set (¢) = 1, and introduce 
« through the proportionality constant in (e,’) ~ 6q'".) In this formalism, where 
we require the action to be locally scale invariant, the terms which were conformally 
invariant before coupling to gravity are easy to recognize: They’re just the ones which 
have no @-dependence. (This may require some field redefinition: typically rescaling 
the matter fields according to their weight as above.) The cosmological term becomes 
Seos = fe tA¢??/-), which is a conformal self-interaction term for a scalar. 
Because what was the vierbein now appears only in the combination e,” — 


g ?/P—Ye,™, there is now the local scale invariance 


m 


és @e,™, od <5 p-2)/2 4 


since this transformation leaves the combination invariant. Gauge invariance of the 


matter action is then (using the infinitesimal parameter ® = 1 +4 ¢): 


0S ee ae 
0=6dSy ~ e,™ D-2 4.0" 
mM~ve Jem t a @ 56 
_2 ,0SM 
TT — _D-2 
a = 56 


Thus, conformal matter has vanishing T“,, since it decouples from ¢. (Actually, we 
also need to scale the matter as above to achieve this decoupling, and there is a 
corresponding 6Sy,/dwW term in the above derivation, so the trace may vanish only 
after applying the matter field equations, as in the derivation of VaT® = 0 from 
coordinate invariance in the previous subsection.) In particular, this is easy to check 


for the massless point particle, where T%, ~ X™X "9mn = 0. 


An interesting effect is obtained by eliminating the compensator by its field equa- 


tion. (We’ll consider just the classical theory here: In the quantum case, integrating 
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out this field produces an additional 1-loop contribution to the effective action.) Be- 
cause this manipulation involves integration by parts, we first expand the compensator 


about its vacuum (asymptotic) value: 
@=1t+5x > L=F[x(g50-7R)x-Rx-R 


Then eliminating y by its field equation, 


net (eee n) 
D—2 
This action still describes Einstein gravity, but is locally scale invariant (though 
not globally, because of the extraction of the vacuum value, and the way boundary 
terms were neglected). Of course, it is nonlocal, and the nonlocality becomes more 
complicated if nonconformal matter is included. Such terms also appear quantum 
mechanically: In two dimensions, dimensionally regularizing D=2+2e, in a Weyl scale 
invariant theory we can get a divergent, yet still Weyl scale invariant, contribution to 


the effective action proportional to 
1 1 1 1 
~ | R—_,.—— R-R]| = --R-3R=R 
€ R- 4550) E O 
After renormalizing the divergent term, which is topological and thus locally scale 
invariant in exactly D=2, but not in D=2+2e, the remaining finite term contributes 
a conformal anomaly (see subsections VIIIA7 and C1). 
Exercise IXA7.6 


The statement that the R term is topological in D=2 neglects boundaries. In 


general the topological invariant (the “Euler number” ) is (the “Gauss-Bonnet 


2 ae 1 €qpt* Dt? 
— —— = ae R ——— 
x / De 2 = 2n Hept tt? 


where t* is a tangent vector to the boundary X(T), as for the worldline of 


theorem” ) 


the particle, and D is the covariant differential (as for the particle equation 


of motion and the radial gauge; see subsections IXB2 and 4 below): 
Peas", Deedee Ve =e ob Vratrea=at 


(We have used the usual counterclockwise contour, and our convention €9, = 
1, or €zy = 1 in Euclidean space.) The additional term in y is the angle 
subtended by the boundary with respect to the surface (/27), as obtained 


from the cross product of t and t + Dt. We have written it in a form that 
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is manifestly invariant under the reparametrization of 7, and the v’s cancel. 


(Of course, it is also manifestly coordinate invariant.) 


a Prove that it is also scale invariant by showing that the connection part of 


the D exactly cancels the contribution of R to the boundaries, leaving 


We 4 
v= ([ Be) 


where we have turned the R term into a boundary term, and its remaining 


let 
27 Ht??? 


patch boundaries 


contribution is from the fake boundaries at the borders of patches (or sur- 
rounding singularities; R = Ow because the 2D Lorentz group is Abelian: see 
exercise [IXA5.3). 


b Note that the dt term doesn’t contribute if we choose a gauge where 
res 


(i.e., ’” = e,"). Demonstrate this by evaluating x in polar coordinates for a 
disk, and in spherical coordinates for the half-sphere. Show the result is half 
that for a whole sphere (exercise [XA5.3). Repeat the calculation for the disk 


in Cartesian coordinates (so then only the dt term contributes). 
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sectelieetmiorcnnra ates arias ic ae C7. 8) C2 |}. See eeenrn re neononveney 


We now consider various gauge choices for coordinate, Lorentz, and scale trans- 


formations. 


1. Lorenz 


We begin with gauges that preserve global Lorentz invariance, which are useful 
for perturbation theory. Therefore, we look first at perturbation by finding the kinetic 
term, which is sufficent for finding linear gauge conditions. (It can also be derived from 
general principles, as will be shown in subsection XIJA5.) We expand the vierbein 
about its flat value, 


é,"” = bq” + ha” 


At the linearized level, local Lorentz invariance implies that only the symmetric part 
of the field, sab); appears in the curvature and the action; we will denote this by hay 
to simplify notation. (In other words, the linearized curvature is invariant under the 
linearized local Lorentz transformations, which gauge away the antisymmetric part 
of the field. This is equivalent to working directly with the metric.) We then can find 
the linearized curvature, e.g., from the results of subsection IXA5 for the variation of 
the curvature, by considering variation about flat space: i.e., replacing Cg, — ha, and 
Va — Og. The result is 
Ra? © O10 hy 


=> Ra- snark = hap + 02h ~. — Ja ho)c = le = 0°" Rca) 


Since this comes from varying the action, the quadratic part of the gauge-invariant 


action must be 
oe / Lh Dap + 2(BPhas)? — h@aCh”s + 2h" ,O°O hrs] 


This part of the action, and the linearized curvature, are invariant under the linearized 


gauge transformations dha, = —O(arr)- 


Exercise IXB1.1 
Take the Newtonian (weak-field, nonrelativistic) limit of gravity: (1) Linearize 
the action by perturbing about flat space (e,” = 6'"+h,™). Keep just the part 
of the pure gravity action quadratic in the perturbation, the part of the matter 
coupling linear in it, and the complete flat-space matter action. (2) Assume 


small velocities. Now consider the problem of a massive point particle in the 
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field of a much more massive point particle (or spherical body); in the above 


approximations: 


a Show the effect of the gravitational field generated by the heavier particle on 
the lighter particle is given by the action for the lighter particle (in the gauge 
S=t=r) 

S=ms & je (m — 4m4? + mhoo) 


b Show this field is given by solving Laplace’s equation 


Roo & Ahoo = Too 


c Show that, with our conventions for normalizing functional differentiation, a 
point mass M in D=4 generates 
Mn 
too = M (2r)?6°(2) => hoo = —-— 
re 
using the usual solution to Laplace’s equation for a point source. Combining 
these results, we see that the potential energy for the particle is 


M 
U = mhop = — = 


which agrees with Newtonian gravity if we identify G = a. (If we restore 


units, this becomes G = kz.) 
The BRST transformations (see subsection VIA4) for gravity again follow from 
the gauge transformations: 
Ge," = C06," —6,"0,C" + Ce," 
OC” = C"0,C"™.. QCa=C"O.0C ese +O, Cs 
QC™ =-iB™, QCw = -iBw 
(Other forms follow from different parametrizations of the gauge transformations, 


and are equivalent to field redefinitions. For theories without spinors, we can work in 


terms of the metric, and avoid Lorentz gauge fixing.) 


Lorenz gauges for coordinate invariance are similar to Yang-Mills. For gravity, 
the gauge-fixing function is 
ta = O hap _ 50ahy 
The BRST procedure works similarly to Yang-Mills. Looking at just the graviton 
kinetic term, the gauge-fixed quadratic Lagrangian for gravity is then, in the Fermi- 


Feynman gauge, 


Lg — Lerr=let (Chas — 50nh,)" = —1A" Oh + th* 0h”, 


=P 7d 
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plus ghost terms. Note that the trace part of h appears with opposite sign to the 
traceless part. This prevents any redefinition which would allow rewriting the La- 
grangian in the simple form —$h® Oho. However, all derivatives have been absorbed 


into (’s, which makes the linearized field equation a simple Klein-Gordon equation. 


There are various generalizations of this gauge condition to include nonlinear 
terms, such as the “de Donder (harmonic) gauge”, which uses the gauge-fixing func- 
tion 

f= 5Om(/—g9""”) 
For example, this allows the field equation for a scalar to be written with only terms 


with both partial derivatives acting on the scalar. 


2. Geodesics 


Consider the field equations for coupling gravity and electromagnetism to a scalar 
particle: From subsection IIIB3, the action for a particle in external fields, rewritten 


in Hamiltonian form, is 
Sy = [ar(-arent(o)lne — A,(xz)]+ vf}, H = 37’ + d(2) 


where we have pulled the v out of H for convenience, and use the “covariant momen- 


or) 


tum 
Tq = Ca Dim or A,(2) = €, (Om + Am) 


in place of p,, (the canonical conjugate to x™) for covariance. All the equations of 


motion except the Lagrange-multiplier constraint 
57° +¢=0 
follow from the usual Poisson-bracket relation 
vO = iH, O 


which can be evaluated by using the canonical commutation relations (following from 


the simpler ones for p,,) 
ee t|%a; | = Cab Tat Paps oo =O 


Thus, 7, acts effectively like —ie, + A,, which is the covariant derivative for gravity 


and electromagnetism, less the Lorentz term. The # equation is the obvious 


un = £""e,,% 
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that follows from varying Sy with respect to m,, while the equation of motion for 7 
is 


—1l- b b 
Ua = —Ca “Toe — Fat — Vad 


Using the relation 
0 = Tobe = Cabe + Wable =  Ca(bc) alr Wibe)a = 0 


we find 

0 Pag rui.! 4+ roe + V°¢d =0 
This is the coordinate-covariant form of the Lorentz force law (plus scalar field). 
With only the gravitational effects we have the covariantization of the free particle 
equation, 


Dp* = p* — up’p°uy.7 = 0 


where “D” is understood as a covariantized 7 derivative (along a worldline with metric 
Vv). 

It’s useful to consider a continuum of particles (“dust”) moving under the influ- 
ence of these fields, such that any two infinitesimally close particles have infinitesi- 
mally different velocities, and only one particle passes through any particular point 
in spacetime (at least within some small region of spacetime). We then can treat 7, 
(or Pm) as a field defined for all x: Choosing a point x also chooses a curve X(T) for 
which « = X(7) for some 7, so we can write (x) in place of m(7). Specifying the 
field 7 also determines this family of curves, since the tangent to any curve is given 
by the X equation of motion 

X™ = une,™ 
(To determine the 7 parametrization, we also specify v, and the hypersurface given by 
the collection of points X (0) from each curve.) Then we can express the 7 derivative 
in terms of x derivatives: q 


— =X"), = v0e,q 


dt 
which gives the manifestly covariant form of the equation of motion 
1 Vota + Fart + Vad = 0 
For vanishing F’ (and thus A) and constant ¢ (= 3m7), this equation 
pV spa = 0 
describes “geodesics”, which are curves of extremal length, since the action is 


So = 0s; =ds” =de™dr"G... 
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for massive particles. These are the analogs of straight lines in flat space. (For 
positive-definite metric, they are shortest lines. Because of the indefinite signature of 
the Minkowski metric, the worldlines of massive particles are actually longest lines, 


while massless particles travel along lines with no length.) 


Exercise IXB2.1 
Uniform circular motion in 2D flat Euclidean space, constant d0/dt in po- 
lar coordinates, is associated with acceleration of constant magnitude. (Or, 
without time, we can say that a circle has constant “extrinsic” curvature with 
respect to the 2D space.) Show that an analogous situation in 2D Minkowski 


space can be obtained by Wick rotation: 


a Starting with the metric for 2D flat Euclidean space in polar coordinates, 
Wick rotate @ to make it a time coordinate (“Rindler coordinates”). Show 
by a transformation to “Cartesian” coordinates that this describes 2D flat 


Minkowski space. 


b Show that any curve described by constant r describes acceleration of constant 
magnitude, by evaluating (d?x(0)/ds”)* in “Cartesian” coordinates. Note 
that the direction of this 2-vector is fixed to be orthogonal to dx/ds (since 
(dx/ds)? = 1 by definition), so this is just the acceleration as measured in the 


rest frame. 


c Define the acceleration in arbitrary curved coordinates (in terms of p- Vp) and 
evaluate it in Wick-rotated polar coordinates, to obtain the same result as in 


Cartesian coordinates. (Use the covariant derivative of exercise IXA7.4a.) 


Exercise IXB2.2 
Equations of motion for particles can be derived from conservation laws. We 
know this already nonrelativistically, for a particle in a potential using energy 
conservation. Now consider a dust with T”” = px™z” and current J™ = pr™. 
(Compare subsection IIIB4. We could use those single-particle expressions 
here, but using dust instead avoids integration. Note that using z or p allows 
us to describe also massless particles. The existence of a conserved current 


corresponds to a complex field with a global U(1) symmetry.) 


a In the case with no external fields except gravity, show that the geodesic 
equation follows from covariant conservation of both of these quantities. (Of 


course, in flat space this gives the usual free particle result.) 


b Generalize to the case of external fields by adding to T”” that of the external 


fields themselves. When taking the divergence of those terms, use appropriate 
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source terms to the field equations in terms of the particle variables p and 
m. (In the case of a single nonrelativistic particle in a static electric field, 
this is the usual derivation of the force on a particle from the electric field’s 


pressure. ) 


For some purposes we need a weaker (but equivalent) form of the geodesic equa- 


tion: If for some scalar f and vector n° 
we VierHfer S eS un", (p- V)p* = 0, f=-(n-V)inu 


for some scalar u, which we can determine by integrating f. In particular, we can 
identify 


U=v > nm™=xX™ 


Thus, the more general geodesic equation allows arbitrary parametrization of the 
geodesics, while the stricter version (f = 0) corresponds to affine parametrization 
(v = 1) if we still want to identify p with X. (Remember, as with all constrained 
systems, the equations of motion p- Vp = 0 imply (d/dr)p? = 0, so any geodesic 
satisfying the stricter equation will have some fixed mass along that particular curve.) 
Exercise IXB2.3 
Show that in D=2 (one space dimension, one time) any lightlike curve is a 
geodesic, using the weaker form of the geodesic equation. (Find f.) This is 
a consequence of the fact that it is impossible to change direction in D=2 


without slowing down. 


The particle (geodesic) version of the conservation of momentum in the direction 


of a Killing vector is 


d 
oeVp.=0 => pVKep=0 => ee aa 


where covariant conservation p- V has become ordinary conservation d/dr (no con- 
nection term) because K - p is a scalar. (See also exercise IXA2.4.) This is the same 


as for the conserved current J? = K,T™ (subsection IX A6). 


3. Axial 


The definition of axial gauges in terms of the covariant derivative is the same as 


for Yang-Mills (n- V = n-0Q). In terms of the explicit gravity fields, 


mVen-0 => n™anre,” =n"o"" nw," = 0 
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In the case of gravity, this implies that lines in the n® direction are geodesics (see 
previous subsection): 


Ln =0 => (n-Vin=(a-e)n=0 


To analyze the consequences of axial gauge conditions for the metric, we need a 
further identity: For any vector field n*, consider the action of n- V on Nm = €m*Na, 
treating it as a scalar; in this calculation we ignore any indirect action of V on curved 


indices. Then 
(Oe te = (n= Vey ta = en’ (Rh Vina t+ Nel Vem” 
The last term simplifies for vanishing torsion, since: 


W Vile HU Veal Th Van == ae PE Vika = Vel bn Vin’ 


=V ph = 6, On 
We thus have 
(n+ O)em?Na = €m'(n+ V)na + Om(gnz) = (€n*Na)Omn” 
Applying this identity to the axial gauge condition, we find 
n-V=n-0, w= 0 = (nee ng=0 => te =] ea te = 00a 


by choosing the appropriate constants of integration. (This amounts to fixing a 


residual gauge invariance.) In fact, we can weaken the assumptions in this derivation: 
(ne V0, Te, =n 0, =| nee, i =0 => te =]e he =O 
We can now determine the form of the gauge condition on the metric: 
ie as ar GH) = hy H=w ha =P a 
Applying these results to perturbation theory, as 
eq” = 6™ + hom 


we then have 
we =n, => nyh’* =0 
tn Ne =O, te = a" 00 Ns =n, => mh”®=0 


and thus nh = 0, so we can again work with just the symmetrized h. 
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The lightcone gauge is again useful for eliminating unphysical degrees of freedom. 


The lightcone gauge conditions are 


For the rest of this discussion we work with just the symmetrized h. Separating out 
the trace part as hj; = hi, + dijh, where hi is traceless, we find for the linearized 


gauge-fixed action 


UP hg Oe 
= —th™ Oh _ 5(hi*)? a P= hh! 


hi = fi=—Oth* + On — 2 4o'n 

hh Sat f+ 2Oh =-dPh— + ath — Satoh + 2h 
1 

D=2}, _ ae 

a ees 


2 


where we have simplified some algebra by writing the gauge-invariant action as the 
Lorenz gauge one minus its gauge-fixing term $f?. (There is some ambiguity in that 
we can shift h’~’ by a 0¢h term, and absorb the generated terms into h’~~.) We see 
that all but hy are auxiliary fields (we redefined h~* and h~ by just shifting and 
applying 0*), and can be eliminated (but watch out if there are matter couplings, 
when eliminating them gives Coulomb-like interactions). (Again, this procedure is 
much simpler than quantizing in the de Donder gauge and then applying a further 
analysis to extract the physical polarizations, as is always done in other texts when 


analyzing radiation in general relativity.) 


The temporal gauge (known also as “Gaussian normal coordinates” ) is used when 


treating time and space separately: In this case we have for the metric 
= do > Jom = Tom 


An alternate way of defining the temporal gauge is to start with a spatial hypersur- 
face, and determine the geodesics normal to this hypersurface (go; = 0), where the 
positions on this hypersurface define x’, constant along the geodesics, and the proper 
times along the geodesics define x° (ggg = —1), with 2° = 0 at the hypersurface. The 
fact that these are geodesics guarantees that the hypersurfaces of fixed, but nonvan- 
ishing, (proper) time are still orthogonal to the geodesics (go; stays zero): Consider 
some constant V™, representing the separation dx™ of 2 “fixed” nearby points in any 


hypersurface, (n- V) = 0. Then the statement n- V(n-V) = 0 that the separation 
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of those 2 points remains in the hypersurface is just the equation derived above, i.e., 
(12> O)em te = 0. 

Equivalently, we can consider a dust of massive particles and choose an initial 
hypersurface orthogonal to their (timelike) geodesics to define x° = s = 0. This 
coordinate system is thus the “rest frame” of the dust; all the information about the 
geometry of the space is contained in the time dependence of the spatial separation 
of the particles (g;;). There is still the residual coordinate ambiguity of how to assign 


x’ on the initial hypersurface. 


Gaussian normal coordinates thus can be useful for studying the dynamics of 
particles: For example, we can study a gravitational field of distant, unknown (or 
ignored) origin (i.e., the curvature of spacetime) by watching the relative motion of 
two nearby particles of such a dust, neglecting the gravitational force/curvature effect 
acting between the two particles themselves. If the two particles start out relatively 
at rest at some initial time (which is well-defined only if they are close and relatively 
slow), then in the temporal gauge the paths of both particles are described by fixed 
x’, independent of x°, since their geodesics are simply lines in the time (n* = 6¢) 
direction, and the proper times of both particles are the same as the time x°. Then 
the distance between the particles at any given time is given by the magnitude of 
dx™e»“, with dx® = 0 and dz’ their infinitesimal separation. Thus, since the 2’’s, 
and thus dz’, are fixed, we want to study the change in e,,“ (really just e;%; eo% = 60) 


with time. Using our evaluation of (n- V)em* from above, we find 
(n- V)°em* = (n- V)Vimnn® = [n- V, Vinjn® = 2"[Vn, Vinin® = —n’n’Roac*em* 


using the axial gauge condition n- V = n-0. For the Gaussian case n* = 6§, we then 
have 


ee a a b 
Cm =—How Gn 


(Of course, vanishing curvature implies geodesics that start parallel remain that way, 
because the space is then flat.) By observing different sets of particles initially at 
rest with respect to each other, we can choose different timelike directions n, and 


determine all the curvature components from their linear combinations. 


Exercise IXB3.1 
Let’s examine some 2D examples of axial gauges in spaces with positive- 
definite metric: 
a Gaussian normal coordinates need not be Cartesian in flat space. Show that 


polar coordinates for the plane define an axial gauge. What is the coordinate 


in the “n®” direction? Give the geodesic interpretation. 
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b Repeat the above for a curved space — the (2D) sphere in spherical coordi- 


nates. 


c Apply the above “equation of motion” (€ = —Re) to the sphere. (See exercise 


IXA5.3.) Show its solution agrees with the obvious. 


4. Radial 


Another useful gauge similar to the axial gauge is the radial gauge (“Riemann 
normal coordinates”), discussed for Yang-Mills in subsection VIB1. In this case we 
have 


LT =e S- Ge Vir a=n One 6, =a 


a case of the more general form of the geodesic equation. Applying the same identity 


as for the axial, we again have 
n-V=n-0, Ont =O => ie Den te=— Oba te = Gh tig =O, 


but now the boundary condition is already implied by the gauge condition near the 


origin: For any infinitesimal 7” = e”, 


een 0) =e". = 2° (0)=o 


oo =O. => ww") S06 


Thus, there is no residual gauge invariance, unlike axial gauges (where the coordi- 
nates of the initial hypersurface need additional determination). Any reference frame 
satisfying these conditions at the origin is called a “local inertial frame”, and is the 
most natural for an observer at that point in spacetime. (In flat space, this yields 


Cartesian coordinates. ) 


Exercise IXB4.1 
We can think of Gaussian normal coordinates as defined by a dust of particles 
with affine parametrization v = 1 and unit mass m = 1, with rT = s = x and 
x’ constant for any particle (x =p =n). For Riemann normal coordinates 
we can think of particles radiating out from the origin x” = 0 in all possible 
directions in space and time, but then some must be antiparticles (traveling 
backward in time), some must be massless (for the lightlike geodesics), and 
some must be tachyons, with m? < 0 (for the spacelike geodesics). However, 


as for the Gaussian case, we can still identify 


Wn” = X™ 
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Using the radial gauge condition, show that these can be chosen as geodesics 
with 
v=<, Air) =e X(0}, p= X(0) 


so all particles start at the origin at 7 = —oo, and their position at rT = 0 
is determined by their initial (constant) momentum. (Thus particles with 
proportional momenta travel the same path, but arrive at different points at 


0 


T = 0; however, in this case 7 is neither the time x” nor the proper time s, 


but just an arbitrary parameter.) 


As we saw in subsection VIB1, the radial gauge is related to gauge-covariant 


translation (in general relativity, “parallel transport”) as, for any tensor ~, 
bg) =e OP aw(y) =eAb(g), = eM Ea) Pmym 


where y is the “origin”, A = A'M, is just a Lorentz transformation, and D is the 


covariant derivative acting at y: 
D, = te (Ds =F Wai (y) My, Dy = aaa Ds D;| = lan Da = Rav M, 


As in general for coordination transformation parameters A°, x* now transforms under 
local Lorentz transformations. (In background field language, this “quantum field” 
transforms under the “background” Lorentz transformations.) Thus, x* is now a 


function of y; it cannot be made even covariantly constant in general: 


Dy=0 => 0=([D,,Di\a* =—2* Rea 


(For more practical reasons, if we defined it to be invariant or constant, the manip- 
ulations that follow would break down.) At this point we have only made a Lorentz 
transformation on w, since it and y) are evaluated at the same point 7. However, as 
for Yang-Mills in subsection VIB1, for the next step we want to identify x* as the 
new coordinate: 
wb (a) = 0 G™(y™, 2)) =e Poy) 

where y’ has implicit dependence on y, since in radial gauges the choice of origin 
is gauge parameters that define the gauge. (The coordinates are defined as radial 
with respect to the origin y.) Thus, we have made a Lorentz transformation ~ — w 


followed by a coordinate transformation q) > wv". 


We also want to define a covariant derivative for x by 


Vi" = (Dp =e Day) 
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where V (as for Yang-Mills) contains only 0, = 0/Ox* and not D,,: 
Va = €a’(£)Op + Wa! (x) My 


At this point we no longer distinguish flat and curved indices, since the Lorenz gauge 


has been fixed. We have then transformed 
y,¥,D—> 2,y',V 


Note that the y-coordinate tensors are the x-coordinate tensors evaluated at the 
origin: 
yO) =v), (Ve")(0) = (DY) y) 
We can identify this as the radial gauge when x(y) satisfies the geodesic condition, 
since then 
Dea) = gee"2 =e 


= g°V0 =e-(Deby =a Dey =e+Dy =z: 0 
making use of q)/(x) = e™ ?y(y). 
Unfortunately, it is somewhat difficult to continue this construction in terms of 


the covariant derivative, but simpler in terms of the “dual” differential forms. We 


therefore define the (Lorentz-covariantized) Lie derivative as 


A 


Loapp=2-Dy, LepD=([x-D,D] 


for any “tensor” (object carrying only flat indices) w and any “covariant derivative” 
(object with a flat vector index free, but multiplying partial derivatives and Lorentz 
generators) D. We generalize to evaluate on not only w and D, but to apply the 
Lie derivative also as part of the transformation exp(L,.p). For that reason, for the 


remainder of this section we will abbreviate L,.p as just £L. We then have 
[x -D, Da] = —(Dax”)Dy + 2°(Tha°De + Roa’ Mr) 


Defining Lie derivatives to satisfy the usual Leibniz and distributive rules like any 


derivative (since we use them as infinitesimal transformations), we then find 
(CEES) a eT. flat =a Rae 
In terms of differential forms, defined as 


ee = aye.” aw! = dy” Wm? 
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D=€D,=d+a'M, = Der =dr*—2°o,? 


we then have 
Le" =, LIT RR) =a DUT) 


LE®* = Da? — E°s°T,,? 
te =f R gg => LD) SPOS Rig" 


which covers all the quantities that appear in evaluating e* on €% and w!. The 


geodesic condition prevents higher derivatives of «* from appearing in the transfor- 
mation law, and allows us to freely reorder all the x’s to the left at the end of the 
calculation for identifying the coefficients of the Taylor expansion, at which point 
we can forget that x depends on y. Thus, these few equations for the action of L 
allow any transformed quantity to be evaluated straightforwardly by iteration, Taylor 


expanding e* in powers of L. 


The important distinction between the transformation laws for D, and €° is 
that for €* the derivatives of x appear only in the combination dx, which makes 
changing coordinates from y (or y) to x easier. Specifically, by iterating the above 


Lie derivatives, we find a solution of the form 
Cf al =f AP eB, gg =ea@ =f"A) + (Dr Bb; 


where A,*, B,”, Aq’, B,' are functions of x and of tensors evaluated at the “origin” 
((D---DT)(y), (D---DR)(y)). For Riemann normal coordinates, we want to fix y 
(e.g., y = 0), so we evaluate the above at dy = 0. Furthermore, we can choose the 
gauge w(0) = 0 (at least for vanishing torsion), so also Da — dx. Then the solution 
is 

E* = dx°B,*, i? San 


Thus, B,” and B,/ are the inverse vierbein e,,% and Lorentz connection w,,! for the 
new coordinate system, 


Va = (B")a°(O, + By’ Mz) 


written explicitly as a Taylor expansion in x by the above method, all of whose 
coefficients are tensors (torsions and curvatures and their derivatives) evaluated at 


the origin. 


However, we can also use these results for first-quantization (where actions are 
expressed in terms of, e.g., €’“/dr for the particle) in background field gauges, by 
choosing y as the background and z as the quantum coordinate (see subsection VIB1); 
then we keep both the dy and Dx terms. 
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Exercise [IX B4.2 


Find the first few orders of this expansion. 


a Using the above method, show that for vanishing torsion 
E® = dx?(5f — p2°t* Rea" + -.-), w! = Fdz*(x’ Ry! +...) 


b Check the validity of this result by evaluating [V,V] to this order from the 
V given by this F and w. 

c Use instead the covariant-derivative method of subsection VIB1. In this case, 
we find 


Va = orp @? dp he ae 


where h,” is chosen to cancel all D,, terms in V, and we have defined 


Da = Ex (y)Din + Wa! (y) Mr 


where now x° is “constant”, so D,, and M do not act on it. (Otherwise, in 
this approach, we would be stuck with tons of D---Dz terms.) In terms of 


the previously defined Lorentz generators, 


Map — Map i Lad] 
Find h to this order, and use it to obtain 
v= (6° = ett Rissa’ is ..-)Ob + (42° Ria i .-) Mae 


restoring M to M. 


5. Weyl scale 


The gauge-fixed kinetic term can be simplified by including the conformal com- 
pensator (see subsection [IXA7). The quadratic part of the gauge-invariant Lagrangian 
is then (6 = 1+ 5x) 

Io = —7e '@(R— 4550) ¢ 


2 
ee —3[h™ han +2(O"Aab)? — h* ah, + 2h7 gO hte] — 3x (Aa — 9° OD" haw) + 355 XOX 


The nicest (globally Lorentz) covariant gauge comes from choosing the coordinate 


and scale gauge-fixing functions 


de = O hap _ s0nh", =F 50aX; 7 =X Rew 
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We use these to obtain the gauge-fixed Lagrangian (see subsection VIB9) 


L = Lo t+ $(ONav — $0ah"s + 50x)” — $(x — h*a)O(x — he) 
= —th™Ohw + tpsxOx 


plus ghost terms. Now the hf kinetic term is simpler. Also, remember that y de- 
couples from conformal matter. These features of gauge fixing make this formalism 
closely analogous to the Stiickelberg formalism for the massive vector. We can also 
define nonlinear versions of these gauge-fixing functions, such as 0,,(¢@e7!/*e,") or 
Om(¢?,/—gg""") for the coordinate gauge, and ¢e~'/? or ¢?,/—g for the scale. 


Exercise IXB5.1 
Find the ghost terms for linearized gravity in the Fermi-Feynman gauge, and 


its simplification with the compensator. 


The scale gauge can also be fixed in terms of the vierbein/metric alone: For 
example, we can fix the gauge 


e=1 


in which case ¢ acts simply as a renaming of e. A more unusual gauge is 
=v 


This is not a restriction on the geometry, since the physical Ricci scalar is effectively 


replaced by its scale transform 
R' = gee) (R 3 421) 


which is scale invariant. In the gauge ¢ = 1, R’ = R, but in the gauge R = 0 it is 
proportional to Od. 
Exercise IXB5.2 
Show that the ghosts for scale transformations propagate in the gauge R = 0: 


Find their contribution to the action. 


More general gauges are possible when matter fields appear. For example, con- 
sider coupling gravity, with compensator, to a physical conformal scalar w~. With 
appropriate normalization of the compensator and physical scalar, the kinetic terms 
for the two fields are identical except for sign: There is a manifest O(1,1) symmetry. 
We can take advantage of this by using a “lightcone” basis for these fields: Defining 
g+ = +7, the full nonlinear (in gravity) Lagrangian L becomes (S = [ dx e~'L) 


L=¢,(240 -1R)d_ 


B. GAUGES 635 


The overall normalization is arbitrary, including sign, since we can rescale either field 
by a constant. Many Weyl scale gauges are possible, and somewhat more transparent 
than making field redefinitions on the corresponding action without compensator. 


Effectively, we can redefine the fields $4 arbitrarily as long as we don’t fix ¢,/¢_ to 


a constant, since that combination is scale invariant. (I.e., ¢,/@_- can be redefined, 
but not fixed.) 


Some of the more interesting choices are: 


gs=lty => L=-}R-9(S50- Ry 
+ ~~ e => L ~~ —4 _ 2 py 
Pt = a => L= y[( = a’) 20 ~~ iRly 


P+ =F, o- it > L —7<Ry 


We can also have any of these gauge-fixed Lagrangians with opposite overall sign, 
simply by changing the choice of either ¢, or @_ by a sign. The first two choices 
are useful because they put the action in standard form, as the usual gravity action 
plus a physical scalar kinetic term. (Thus, coupling a massless scalar to gravity either 
conformally or minimally is equivalent, and the two cases are distinguished only by 
interactions.) In fact, the first choice, or “temporal gauge” 61+ ¢_ = constant, just 
returns us to the form without compensator, @ = 1. On the other hand, changing the 
sign of @_ yields the “axial gauge” ¢, — @_ = constant, which is fixing the physical 
scalar as ww = 1. The overall sign of the action changes because the physical scalar 
is traded for the compensator, or the corresponding part of the metric. This gauge 
is closely related to the “string gauge”: In our third choice above the gravity action 
is invisible until the surviving scalar has been expanded about its vacuum value. 
The constant a is arbitrary except that it must not vanish (so that ¢,/@_ is not a 
constant). In particular, this action appears in string theory, with the choice 
“= —— => L=9(0-FR)p 

which eliminates explicit D-dependence. Again the scalar appears with the wrong- 
sign kinetic term, but R appears with the right sign (or vice versa), because of more 
complicated redefinitions. The sign of the O) changes back to the usual for |a| > 1. 
However, for ja] = 1, it disappears completely. A similar result occurs for the last 


choice, or “lightcone gauge” @_ = 1. 


Exercise IXB5.3 
The property that distinguishes this kinetic term for a scalar coupled to grav- 


ity is the O(1,1) symmetry: 
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a Before fixing the Weyl scale gauge, the continuous SO(1,1) subgroup of this 


symmetry is just the scaling d, — A*!dz. After gauge fixing, this trans- 


formation may change the gauge, and thus may need to be combined with 
a constant Weyl scale transformation to preserve the gauge. In that case 
the vierbein will also transform under the resulting modified SO(1,1) trans- 
formation. Find the SO(1,1) transformations for y and e,"” in the above 4 


gauges. 


b There is also the “parity” transformation of this O(1,1), 6, < @_. Find the 


modified form of this transformation for y and e,"”. 


Exercise IXB5.4 
Add to the above action a term proportional to (¢, — ¢_)?¢_4/P-?).. By 
considering various gauges, show this action is equivalent to (1) the action 
for gravity plus a scalar conformally coupled to it, with a renormalizable 


self-interaction, and (2) an R+ R? action with no scalar. 


Although all these choices are equivalent in perturbation theory (though the 
physical scalar may require a nonvanishing vacuum value), they aren’t necessarily 
so nonperturbatively, depending on the ranges of the various scalars. Unfortunately, 
nonperturbative gravity is not understood well enough (even classically) to make such 
distinctions, even though they may be important physically. The above considera- 
tions generalize straightforwardly to the case with many physical scalars, where we 
may consider symmetry groups such as O(n,1). If the physical scalars form a nonlin- 
ear o model, the compensator may join in to make the o-model groups noncompact: 


Examples of this appear in supergravity and strings (see below). 


The appearance of a physical scalar can also affect the way scale gauges are chosen 
in conjunction with coordinate gauges. For example, a result similar to the one found 
at the beginning of this subsection can be obtained from the (linearized) action with 


both compensator and physical scalar w (where (w) = 0), 
L& Lo — 7vOy 


choosing the same y-dependent coordinate-fixing term (f,)?, but imposing the scale 
gauge 

b= kv — hs) 
The result is identical to the one given at the beginning of this subsection, except 
that now no scale ghosts appear: The scalar that appears as h®, is now physical, and 
no longer needs a ghost to cancel it. This is the perturbative “string gauge” for scale 


invariance, which appears automatically in covariantly gauge-fixed string theory. 
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Exercise IXB5.5 


Let’s investigate such gauge choices further: 


a Starting with the Fermi-Feynman-gauge-fixed linearized gravity action of sub- 
section IXB1, add the physical-scalar kinetic term —<v0y. Separate the 
traceless and trace pieces of h,,. Show that the string-gauge action (i.e, the 
one given at the beginning of this subsection if we ignore ghosts) follows from 
simply switching 

boa, 
and then identifiying the new 7 with /2y/(D — 2). 


b The way the physical scalar of string theory appears in the gauge-invariant 
and gauge-fixed action is slightly more clever than as described above. (See 
subsections XIB5-6 below.) The kinetic term (already in the string gauge for 


scale invariance) is 


s= f dx o(0- 4R)6 


where the missing e has been absorbed into & by a field redefinition. (Since 
@ is thus not a scalar, we define O@ by e~'/*He!/?9, since e!/? is a scalar.) 


Expanding 6 = 1+ x, the linearized gauge fixing is now simply 
L—L+35(0hab + Oux)? 


(or we can use the nonlinear gauge-fixing function 0,,(®e,"")). Show the result 


is the same as above. 
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oMiuatgeeentaaaeGa C. CURVED SPACES ...................... 


There are some important solutions of general relativity that have no close analog 
in Yang-Mills. Here we consider the ones relevant to the only experimental verifica- 
tions of this theory: Solutions outside approximately spherical matter distributions 
(like the Sun and Earth), and those describing the Universe itself. 


1. Self-duality 


Plane wave solutions can be constructed for gravity in the same way as for Yang- 
Mills (see subsection IIIC3): A little more work (solving the torsion constraint, or 


using the result of the free theory) gives 
Vie? =7 CR" @ 6 =k" @ Me (v=o, Ve?) 


where R***9 is an arbitrary function of «~, but symmetric in i7, and the empty-space 


field equations imply it is also traceless: 
Rttt = 0 


If we want to couple Yang-Mills to gravity, then we can still write exact solutions as 


long as both waves are parallel; then 
Rt = ott = Lip( ptipty 
g 


where here g? refers to the Yang-Mills coupling. (Similarly, we can add in other fields, 


such as massless, neutral scalars or particles.) 


Exercise IXC1.1 
Check that the gravitational plane wave solution satisfies the field equations 
and torsion constraint. Show that we can also find more-special solutions of 


this form satisfying 
Gmn = Nmn + artr(AmAn); Fe enero = aztr (Finn! pq) 


This has the interpretation that the “graviton” is the bound-state of two 
“eluons”. However, it is only a kinematic effect, since the two gluons happen 
to be traveling in the same direction at the same speed. (We saw in subsection 
VIIB5 that a similar effect always occurs in D=2, since there only two spatial 


directions exist.) 
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Self-duality for Yang-Mills was discussed in subsection IJIC4. Similar remarks 
apply to gravity: We again impose 
[Va, Vol = +2€atea[V°, V4] 


Self-duality again implies the field equations, by dualizing the Bianchi identities: For 
gravity 


Rlaveja 0 > 0 apc Rieae — je ebcpg Ri a0 = —R*, 


(Note there is no extra minus from ¢? in even time dimensions.) While it might 
appear that the self-duality condition is still second-order because solving the torsion 
constraint makes the Lorentz connection the derivative of the vierbein, the self-duality 
allows the gauge where the connection is also self-dual, and this condition effectively 


becomes a first-order field equation: 


1 e 1 de 
Rated = Redad > Rabed = Sea tedar lia f => Wabe = TE F€bedeWa 


Exercise IXC1.2 
Apply exercises ITIC3.2 and IIC4.1 to gravity: 


a Rewrite all the above results of this subsection in spinor notation for D=4. 


b For arbitrary dimension D, generalize e_* to an arbitrary function of x~, x’, 
find the covariant derivative and curvature in terms of it, show the source-free 


Einstein’s equations imply it satisfies 
(0’)Pe_t —(0 


and in D=4 identify the pieces analytic and anti-analytic in x’ with the two 


polarizations. 


In four dimensions (2 space + 2 time), lightcone methods can again be applied 
(see subsection IIIC5): Now 


iv, V7] _ COTE ROO Mares (Wage? = 0) 


The fact that [V‘’, V°'] has only an Mg: term poses an additional constraint; 


the full solution is then 
VEX = 98%, VO" = AO" + (GP* IPF) O® g + 3(O° 0°" 0*" 6) Mary 


RVYS = 48% 926" 27 gP%" g 
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(In this case, the existence of covariantly constant spinors is a consequence of self- 


duality.) The equation of motion that follows from the final condition is now 


Dd — 10°" 0°? 6) (0° ad® grb) = 0 


2. De Sitter 


The simplest spaces are those where the Ricci scalar is constant, and the other 
parts of the curvature vanish: 
i= kof. 54 


These are special solutions of the field equations without matter, but with a cosmo- 
logical term, where there are no physical gravitons (the Weyl tensor vanishes), and 
thus represent the vacuum. Since there are no physical degrees of freedom, we can 
represent this space by just the conformal compensator: i.e. the vierbein (metric) 
is just the flat one up to a local Weyl] scale transformation. We thus have (from 
subsection IXA7) 


Ry! — B50, 096 — 56,54 (48)? = koe, 54 
[a] [a™d] [ab] 


where we have written the curvature as a scale transformation of flat space Ra»? = 
0, V = 0: The space is “conformally flat”. Separating this equation into its ir- 
reducible parts with respect to the Lorentz group, the Weyl tensor part vanishes 


identically, leaving 
260¢ — D(d6)? = Dk, Dd,%,P = na» Oe 


(The latter equation isn’t implied in D = 2, where the global conformal group is 
larger, and more general coordinate choices are possible for this solution. However, 
we can still use it consistently.) The latter equation can be solved easily: Looking at 
a # b, we see that @ is a sum of functions of one variable. Then looking at a = 0 tells 
us that these functions are quadratic and have the same quadratic coefficient, while 


the former equation gives k: 
@=A+ Br, + C5r"La, k = 2AC — B? 


We can choose any A, B*, and C’ that give the desired value of k: For example, we 
can choose the solution @ = 1 + ska? (giving the usual flat-space coordinates for 
k = 0), or 6 = Bx, (choosing the direction of B® as appropriate to k = —B? — 
spacelike, lightlike, or timelike). 
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Exercise IXC2.1 

Show that after a Weyl scale transformation the action for gravity including a 
cosmological term is (up to a sign) that of a conformal self-interacting scalar 
coupled to gravity. Use this to show that the de Sitter space solution (in the 
R=0 gauge) yields an “instanton” for this scalar theory, and compare with the 
Yang-Mills instanton of subsection HIC6. Show that similar solutions exist 
for massless scalars in arbitrary dimensions with potentials ~ ¢” for arbitrary 
n (but then k = 0). 


The geometry of this space can be understood most easily as that of a D- 
dimensional hyperboloid embedded in a flat (D+2)-dimensional space, where we add 
one space and one time dimension: Again using the methods of subsections IA6 and 


IVA2, we now supplement the constraint 
y=0 => y=eu%, (w 


with the additional constraint 


1 1 
A 
n“yz=1l => e — 
naw,  —n7 — nth, +n" Xa 


where n“ is a (D+2)-vector, yielding the intersection of a cone and plane. In partic- 
ular, for n? 4 0 we can write the metric on the space whose coordinates are all but 
n-y: 

ya?) = £4n7=0; —ds* = dz? 
which is the definition of a hyperboloid. Comparing the metric, we find the previous 


result: 
—ds”? = dy* = e*dx*dxq, e=@!' = k=-n? 


Of course, by appropriate choice of the original flat space, we can choose a space 
of any signature. In particular, we see that for a unit sphere k is normalized to 1. 


Thus, with our conventions we have in that case 
unit sphere: Ray! = dfn 


(but the constant value of the Ricci scalar will depend on the dimension). 


This gives the most general coordinate system for de Sitter space as a local scale 
of flat space, since conformal transformations are the most general coordinate trans- 
formations that will just replace this scale factor with another, and they just rotate 
n4. The symmetry group of the D-dimensional subspace that satisfies these two con- 
straints is as big as the Poincaré group, namely SO(D,1), ISO(D—1,1), or SO(D—1, 2), 
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depending on whether n?<, =, or >0: The former constraint preserves the conformal 


group, while the latter kills a timelike, lightlike, or spacelike coordinate. 


Exercise IXC2.2 
We can also start instead with a D+1-dimensional space, which is a natural 
choice for the symmetry group of de Sitter space: Consider the metric and 


constraint 
“hd adF ak ded ety, 1S =k PP ie Sey 


Both equations have the same global symmetry group, determined by the sign 


of k; k = 0, flat space, can be considered as a limiting case of the others. 


a Solve the constraint y? = 1 — z(x) as in subsection IVA2 for ¢? = m? > (x), 


and substitute to find the metric in terms of x. 


b Find the conformal transformation on x* that relates this coordinate sys- 
tem to the more general one above. (Hint: Use z of the D+2-dimensional 


construction. ) 


3. Cosmology 


As discussed in subsection [VA7, the universe is approximately isotropic (rotation- 
ally invariant) and homogeneous (spatially translationally invariant), so the metric 
should depend only on time. This means that the 3D subspace at any fixed time 
should be 3D spherical, flat, or de Sitter space, up to an overall time-dependent scale 
factor: 

—ds? = —dr* + ¢*(7r)r 


where TY is the de Sitter metric for the 3 other dimensions for k = 1,0,—1 (given, 
e.g., by the coordinates in the previous subsection.) By a simple redefinition of the 


time coordinate, this can be put in a form which is conformal to a static space: 
ds? = ¢*(t)ds®, ds? = —dt?2+7 


where by “#(t)” we really mean “¢(7(t))”, and the two time coordinates are related 


by 
1 
cade => f= | dr = 
(rT) 


Using previous results for 3D de Sitter space, we find ds? has curvature 


or a= je o(r(t)) 


R,;* = koi6 rest = 0 


l 
a]? 
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where k = 1,0,—1. The case k = 0 (in good agreement with observations, at least 
for anything but times shortly after the Big Bang) reduces to the dilaton cosmology 
of subsection IVA7; here we will generalize the results given there, with a different 


(and sometimes better) derivation, from general relativity. 


Again we begin by considering a universe filled with noninteracting dust, its rest 
frame defining the preferred time direction, its homogeneity and isotropy the source 
of those properties of spacetime. Working directly with the energy-momentum tensor 
(rather than deriving it from that of the particle, as in subsection IVA7), we then can 
write (compare exercises IIIB4.2 and IXB2.2) 


Te = py (tutu, u* = 64 
where py is just the spatial density of particles in the “rest” frame. One way to 
derive the @ dependence of py, that generalizes straightforwardly to other cases is by 
using conservation laws: By considering particles all of the same mass in units m = 1, 


or by considering J and T for each individual particle (since in this case we neglect 


interactions), we have from current conservation 
J*=pyu® > 0=V,J% = edme'J%e,™ = (67 *)00(¢*) (pmo *) 
> py =3ad° 
for some nonnegative constant 3a, and using (covariant) energy-momentum conser- 
vation as a check, 
u"Vaty = Volts = > Oouy =0 (geodesic) => Ver =4fV,J° + J°V a =0 


where we have used (from the result of subsection IXA7 for scaling covariant deriva- 
tives) 
Vo=0 +> Vo= od 
(Note, however, that V; has Lorentz pieces, and Mo;J; ~ Jo # 0 even though J; = 0.) 
Exercise IXC3.1 


Find completely explicit expressions for the covariant derivatives in this case 
(choosing some coordinates for Y for k = 0,+1) using just the Wey] transfor- 


mation method of subsection IXA7. 


For radiation, the momenta of the photons can’t be timelike (they’re lightlike, of 
course), but we can still use rotational and translational invariance, together with the 
fact that the trace of the energy-momentum tensor vanishes (from scale invariance: 
see subsection IXA7). Then 


Tat = prlt)d(4utu’ +”) 
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There is no conserved current, but energy-momentum conservation alone determines 


0=V~TH = toumu WV ar a V'tpp =o ‘E5(g0m Oo — 50opr) 


“Ae A/S 7 
=o bp Oopy; PR ==> pr = be“ 
for some nonnegative constant 3b. 


Writing the vierbein as a scale-transformation of the constant curvature space 
discussed above (de Sitter in spatial directions, flat in other directions), the gravita- 
tional field equations with matter and radiation become (using results from subsection 
IXA6 or 7): 


6[(Vad)(Vod) — 4nas(V)?] + [(maol] — VaVe) + (Ras — 4nasR)]¢? = 2(Turas + Trav) 


The only independent components of this equation are the 00-component and trace, 


which are, after multiplying by an appropriate power of @: 
20 + ske =ad+35b, o+ko=a 


For k = 1, these are just energy conservation and the equation of motion for a 
harmonic oscillator (centered at ¢ = a). The 00 equation gave energy conservation 
because Too is the energy density. The trace equation gave the field equation for ¢, 
which is proportional to the time derivative of the 00 equation, due to the relation 
of T*, to 6S/d¢@ given earlier. These equations are easily solved: Imposing the initial 
condition (0) = 0 (i.e., we set the “Big Bang”, when curvatures and energy density 
were infinite, to be t = 0) and 4(0) > 0 (so ¢ > 0), 


l—cost sin t 
k=40 ¢¢ gea< iP +Vb4 t 
—1 cosh t—1 sinh t 


The “physical” time coordinate is then T = iF dt @. In general ¢ can’t be expressed 
directly in terms of T, so we use the expressions for both in terms of t. For example, 
for k = 1 and b = 0 (just matter), we get a cycloid, which has only such a parametric 
expression. Explicit expressions can be found for a = 0 (just radiation): (7) is then 


a circle, parabola, or hyperbola for k=1, 0, —1. Also, for k = 0 and b= 0, ¢~ 77/8 


(vs. \/7 for a = 0). 
Exercise IXC3.2 


Find the modification to the equations of motion when a cosmological term 


is included. 
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Returning to the case of pure matter (b = 0) as in subsection [VA7, we now find 


the energy conservation equation for general k 


Again comparing to the Newtonian equation for a particle, we see that —$k cor- 
responds to the total energy, determining whether expansion is eternal or leads to 


collapse. 


4. Red shift 


We now generalize the results of subsection IVA7 on cosmological red shift by 
considering Killing vectors. Since the cosmological solutions are related to static, 
isotropic, homogeneous spaces by a time dependent (but space independent) scale 
transformation, the symmetries of this space are just in the spatial directions, and 
are basically the same as before the scale transformation. Specifically, the Killing 
vectors that survive the scale transformation Vz = ®Vq + (V°@) Map satisfy (see 
excercise IXA7.3) 


K-V@=0 = K=K = K.=o"'RK, 


We then find for conserved momenta K“p, ~ @~'p,. Since the K’s which survive are 
just the spatial ones, we at first find only the spatial components of ®-'p, conserved, 
but the conservation of the time component follows from p%p, = 0 for photons. Thus, 
Da ~ @~ o-'. Since p, is what an observer measures as the components of momentum 
(in his “local inertial frame”, a gauge where at his location the metric is flat and 
its first derivative vanishes), observers measure the photon’s energy as having time 


dependence ~ ¢71. 


Exercise IXC4.1 
Using this result for the @ dependence of the momenta of individual particles, 
we can now rederive the @ dependence of p’s of the previous subsection directly 


from the explicit expressions for J and T of the point particle. 


a Rederive J and T in curved space as in subsection IIITB4 and show that 
J”, = e(p°)e(21)*6° (a — X), TM = JM" = J%Hp™ 


b From Killing vectors we just saw that 


a0 fo (m#0) 
a re (m= 0) 
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where the massive particles are at rest (p* = mdj). Combine these results to 


find 
o> (m#0) 
4 (m=0) 


c Find the factors multiplying the p’s in T@ for the two cases of dust and 


Pew os, po rmaney ~ | 


radiation from the explicit expression for 7”. For the massive case (dust) 
all particles can be taken at rest, but for the massless case the particles 
travel at the speeed of light, so average over particles traveling in the three 
spatial directions and their opposites. (p is a continuous function obtained by 
summing the 6 functions of all the particles. However, for the above results 
it is sufficient to consider each individual particle for the massive case, and 6 
particles at the same point going in + orthogonal directions for the massless 


case. ) 


As discussed in subsection IVA7, astronomers use the parameters H, g, and 2 
to measure general features of cosmology. Here we can generalize the analysis to 
k £0, which we have solved above. In the case of pure matter, and with vanishing 
cosmological constant, (2 = 2g. Then the “critical” value is q = 5; for which k = 0: 
For q > 3; k = 1, while for q < 3; k; = —1. In this case we also see that for a given 
value of H the critical value of the matter density is p, = 3H 2. If the matter of the 
universe has this density, we have k = 0, and spacetime is conformally flat. If it has 


greater density, we have k > 1, and space is closed. 


Exercise IXC4.2 
Solve for 2 and q in terms of just a,b,k, and ¢ (but no time derivatives). In 


particular, show 


b=0 => N=2%M=(1-#¢)" 
é4=0 = M=g=(- 9)" 


5. Schwarzschild 


All gravitational experiments outside of cosmology are based on the “Schwarz- 
schild solution”, which describes spherical symmetry outside the region with matter. 
Assuming also time independence, which is a consequence of spherical symmetry 


(Birkhoft’s theorem), we look for a metric of the form 


—ds* = —A-?(r)dé? + B-?(r)dr? + r?(d6? + sin?6 dd?) 
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(Other coordinate choices are possible, e.g. —A~*(r)dt? + B-?(r)[dr? + r?(d6? + 
sin? dd*)]|.) The first step is the choice of a vierbein: The simplest choice following 


from this metric is 


1 


1 
= Ad, = Do,. = -0p, = — 70), 
. ae, = a ea ee 


(This can also be used as a starting point in place of the metric.) The next step is to 
find the commutators of the e’s, which tells us what w terms the V’s must have to 
cancel these cgp°’s (vanishing torsion): 

[eo, eg] = —r~tcot 6 eg Vo has Mog V, = Bo, 

[e,, er] = B(In A)’e: Vi has Mi, Vi. = Ad; + aMi, 

lenee| = —r—! Beg = Vo has Mo = Vo =r te + BMyo 

ler, eg] = —r Beg Vo has Mro Vo =(r sin 0)-'0g + yMre + 6Moe 


where a, 3, and 7 depend only on r, while 6 depends also on 6. (Their explicit forms 


are already clear at this point, but we'll collect the results below.) 


We can now determine these Lorentz connections and compute the curvatures by 
calculating the V commutators. Since we now use explicit functions for the vierbein 
and connections, we use the method described in subsection IX A2 for this situation: 


Using the identities 
[Mis, Va] = No2Vi, [Mio, Mp3] = 122M)3 
[Vi, V2] = [ex + W1, €2 + wo] 
= {[e1, eo] + (e1w2) Mo — (e2w1)Mi} + {w1[Mi, V2] — wel Mo, Vi] — wiwe[Mi, M2]} 
we then find: 
[Vi, Ve] =—08Mig = Reow = —a8 
[Vi Vel =—ayMig => Ristg = —Or’Y 
[Vi, Ve] = —B(In A)'e: — Ba'My, +aVi = [a— B(In A)'Je: + (a? — Ba’) Mey 
=> a=Bi(in A), Rirtp = 02 — Bol 
B B 
[Vr, Ve] = ——e9 + BEM + BVe =(8- = eo + (6? + BG’) Myo 
B 
=> B = 4 Ryore = —(3? + Bp’) 
B 
[Mes Vol = — 7&3 + By Mr + Bo’ Mog + V6 


B 
= (y— Jeg + (7° + By’) Mrg + (75 + BO!) Mog 
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B 
> Y= 5, Rang = ~(Y + By), — Regog = —(76 + BO’) 


t 0 1 
[Vo, Vol = a= Eg + (060) Mos + OV + ByMoe = BOM,¢ 
t 0 1 
= (6 es + (7 — B)bMrg + (5 + By + =a)6) Mog 
tO 1 
a — » Ragas =—(8° + By + 095), Rrgag = 0 


Collecting the results: 


— -1 
V, = AQ + B(In AY My, Rey = BA[B(A~)'! 


B?2 
Vr= BO, Rote = Rises = (in A) 
Vo= 96 = 7 Mere Reoro = Rrdrd ——— 
1 cot @ B 2 
= a) Mog + —M, L=B 
% > sin 0 we r apm re Reeee = re 


Exercise IXC5.1 
Find the covariant derivative for the 2-sphere in spherical coordinates 


ds? = dé? + sin?6 d¢” 


in terms of the single SO(2) generator May = €a»M by the above methods (and 
not that of exercises IXA5.3 nor IXA7.5). Calculate the curvature. Find the 

three Killing vectors. (Hint: What is the symmetry of the sphere?) 
A simpler method of finding covariant derivatives and curvatures in this case is 
the Weyl scale method of subsection IXA7. (We already applied this method to the 
much simpler example of cosmology in subsection IXC3.) We start with the trivial 


covariant derivatives for the 2D metric 
=is = <dP 4dr? 


which are just partial derivatives (zero curvature). Then we make the coordinate 


transformation 


dr — 


(explicit integration of this expression isn’t needed in either the metric or the covariant 
derivatives), which modifies one of the covariant derivatives, 
2 


A B 
ais 4 =<d/ + ae Vr When Viv On 
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while the curvature still vanishes. (We have only chosen non-Cartesian coordinates 


for flat space.) Next, we make the scale transformation 
@=ra 


to obtain the metric 
—ds? = —(rA)*dt? + (rB) dr? 


Applying the general formula 

ds? =@"d? > Vi=6V,+(V°S) Mas, 
we find B 

Vr —= TD Ops Vi —_ r AO; + GTA) Mer 

Since the space is only 2D, the general equation 

Rip! = FP Rav + GEV VID — 56,55 (VO)? 
simplifies to 

Rie” = 5R66,69, 3h’ =6(5R+O0 In) 
so at this stage we have 


2 [Ban ray] 


Any 2D space can be expressed as a scale transformation of flat space locally, es- 


3R— (rA) 


sentially because the curvature has only one component. (Globally this is not true, 
since the integral of the curvature, which is scale invariant in D=2, is different for 
different topologies. This is related to the fact that for nontrivial topologies more 
than one coordinate patch is needed; the missing part of the integral can be hidden 


in the boundaries of the patches: see exercise IXA7.6.) 


Now we should repeat this procedure for @ and @ to get the covariant derivatives 
for the (2-)sphere, but this has already been done earlier. Besides, we don’t need 
those expressions explicitly, since spherical symmetry means they vanish on anything, 
and we already know the curvature of a sphere. (So, we can also avoid choosing a 
coordinate system for the sphere.) Thus we can immediately take the direct product 


of the sphere with the above 2D space, and make the final scale transformation 
p= 


The result for the final covariant derivatives is 
B 


GA Mer, Vi = ly, ni 1BMir 


V, = BOo,, Vi= AGt+ 
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where V; are the covariant derivatives for the sphere, in agreement with the previous 
method. The curvatures are also easy to find: Besides the V,@® we needed for the 
covariant derivatives, the only second-order derivatives we need are V2h and V2. 
(V,@ vanishes, but V?® is nonvanishing because of the M;, connection term in V;, 
converting V,@ into V,®.) Thus we need to evaluate 

Ry = Ohl 1 — (V4) 

Ri = 8AV eV") — 64 (V,4)| 
RU kl ol! 
Rij = Sf Oj 219R + (V2 — V2yIn =| 

where 7’ = (t,r), and the V’s and R on the right refer to the 2D t-r space just before 
or after the direct product. The result also reproduces the previous. The final result 
for Rip can be obtained even more simply by noting that it agrees with what we 
would have obtained by a single scaling for the 2D space —ds? = —A~2dt? + B~?dr?, 


because of the triviality of the 6 and @ derivatives. 


Having all the curvatures, we can now calculate the Ricci tensor, which appears 
in the field equations. The nonvanishing components are: 


Ry = Rererp + 2Rigt9, Ree = —Rertr + 2Rrore, Rog = Roe = —PRiote + Rrore + Rogod 


Vanishing of R,y— snap is equivalent to vanishing of R,». In terms of these curvatures, 


we see it also implies 
—Rertr = 2Rioig = —2Rrorg = Rogog 
These are easy to solve: First, 
Ree =—Rrr = (nA =-(n BY => A=B! 


where we have fixed the proportionality constant by requiring A,B — 1 as r — co 


(redefining t by a constant scale transformation). Also, 


i k 
(-B’?) => 1-B=- 


—2R6ro => Rosas > (1 = B?)' => a : 


ke 
=> B= ,4/1-- 
f 
for some constant k. The last field equation is then redundant. (As usual, the field 


equations are related by the Bianchi identity.) The constant k can be related to the 
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nonrelativistic result by comparing at large distances. (See exercise [IXB1.1.) We 
then find k = 2G'M, so the final result is: 


IGM IGM\—1 
a (1 _ —) ge (1 _ —) dr? + r2(d6? + sin26 dé?) 
r 


Exercise IXC5.2 
Repeat this Weyl scale derivation of covariant derivatives and curvatures for 
the Schwarzschild metric in dimensions D>4. (Hint: Do not use explicit 


expressions for the covariant derivatives of the higher-dimensional sphere.) 
Solve for A and B. 


More generally, if we have some spherically symmetric, static matter distribu- 
tion, then the only nonvanishing components of the energy-momentum tensor will be 
Tu, Tr, and Tyg = Tyg (representing energy density, radial pressure, and isotropic 


pressure), all functions of just r. Repeating the above procedure, we integrate 
r 
[r(1 — B?)]' = 2r°Th,, [In(AB)|' = ~ ap Tt + Thr) 


while the remaining equation is redundant. 


Exercise IXC5.3 
Use the local conservation law for energy-momentum to determine T%g in 


terms of T}, and T,,,. 


For example, for a spherically symmetric, static electromagnetic field the only 
nonvanishing components of the field strength are Fj, and Fogg, corresponding to 
electric and magnetic charges, respectively. Then the invariance of TJ under a duality 


transformation (see subsections ITA7, IIIA4) implies 
yale = eat. = ASB” 


again, since on this F,, duality effectively replaces (0,¢) < (it,r), with the i from 


Wick rotation. Local scale invariance (see subsection IXA7) then tells us 


To =0 => Te = —Trr = Too = Top 


Exercise IXC5.4 


Let’s rederive these results by brute force: 


a Derive T,, for a general electromagnetic field by varying its action with respect 


Loe,” org, 
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b Find each of T’s components explicitly in terms of F;, and Fog in the case 
where those are the only nonvanishing components, and show they appear 


only in the combination F? + Fy: 


As usual, these field strengths can be found easily from the integral form of Gauss’ 


law by integrating over a sphere: For example, for the magnetic field 
magnetic charge ~ 5 fava" fon = Arr? Fog 


for the Fg, component of Fy (integrating over 6 and ¢), since the metric (and vierbein) 
for 6 and ¢ is the same as for flat space. By duality, the solution for Fj, in terms of 


the electric charge is the same. The result is 
TH = 7 Q= ce a 9”) 


for electric charge e and magnetic charge g. The 1/r* dependence also follows from 
scale invariance, since the charges are dimensionless (and the matter field equations 
decouple from A and B). (Again, since the solution does not extend to r = 0, we 
normalize by comparing F,, or Ty» at r = oo to the flat-space solution.) The net 
effect on the Schwarzschild metric is 

_2GM _, _2GM 2Q 


r r r2 


1 


Our solution relates to the usual mechanics normalization of the charges (see subsec- 
tion VIIA3), restoring G, as 


2Q = G2n(e? + g”) = G(e?, + g2,) 


Exercise IXC5.5 
Let’s also apply brute force to solving Maxwell’s equations V,F@ = V (alte) = 
0 (outside the matter). 


a As a warm-up, using directly the above covariant derivatives, show that in 
flat space 
Yeas & Ver ar er. 


Note that the covariant derivative of a vanishing component doesn’t necessar- 
ily vanish (just as the ordinary derivative of a function that vanishes at some 
point doesn’t necessarily vanish at that point): Components of V other than 


V, contain Lorentz generators that rotate other components of V to V,. 


b Solve Maxwell’s equations in differential form for Fj, in the above case. Use 


the empty-space solution to define the normalization at infinity. (Actually, 
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in this case, the charge is well-defined in terms of the flux of the fields, as 
described above, but gives the same result here because the space is asymp- 
totically flat.) 

Exercise IXC5.6 
Spherically symmetric solutions can also be written in Eddington-Finkelstein 
coordinates as 


—ds* = —dt* + dr? + r°(d6? + sin?@ dé”) + H(r)(dt + dr)? 


where H = 1— B? = G[2M/r — 2n(e? + g’)/r?] in terms of the above results, 
and is linear in G. (In these coordinates, gravity looks Abelian for this so- 
lution.) Note that this form (and its Abelian nature) closely resembles the 
general wave solutions of exercise IXC1.2b (but the “Cartesian” coordinate 
zx! is now replaced with r). 

a Obtain this form from the above forms (where A = B™') by a coordinate 
transformation. (Hint: The angular term didn’t change.) 

b Find V directly from this form of the metric. (Note: It might differ from the 
previous by not only coordinate but also local Lorentz transformations.) 

Exercise IXC5.7 

Consider the plane wave in the coordinates 


_ ds? = —2detde- + D(07) (29 ay? 4 72800) dz?) 


Calculate the covariant derivatives and curvature tensor by the first method of 
this subsection (double-counting and subtracting, not the Weyl scale method). 
Show that the field equations reduce to 


L" + (8")L=0 


Exercise IXC5.8 
Use the first method of this subsection to calculate the covariant derivative 
and curvature tensor for the metric 


—ds” = —dt? + 2e"dt dy — 5e”°dy” + da? + dz? 


Show that this metric satisfies the field equations with a cosmological term 
for a dust at rest with respect to this time coordinate; i.e. 
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where A and p are both constants. 


Exercise IXC5.9 
Use this method to calculate the covariant derivative and curvature tensor for 


the cylindrically symmetric metric 
—ds* = —A~?(r)dt? + B-?(r)dr? + r7d6? + dz” 
Assume the matter in this problem is a “perfect fluid”, 
T® = putu’ + P(n® + utu) (u? = -1) 


Solve the equations of motion for the gravitational field to find A and B, as 
well as the pressure P and particle density p. What is the implied relation 
between P and p? 


Exercise IXC5.10 
Use this method to calculate the covariant derivative and curvature tensor for 


the following metric, corresponding to that outside a planar mass distribution: 
—ds* = —A-*(z)dt® + B-?(z)(dx? + dy’) + dz” 


Solve Einstein’s equations in empty space to find A and B (up to some con- 


stants of integration). 


6. Experiments 


When comparing to the real world, it is useful to know some astrophysical radii: 
1) Earth’s orbit (1 AU): 1.5 x108 km 
2) Solar radius: 7 x10° km 
4) Solar gravitational (Schwarzschild) radius (2G.M/c?): 3 km 


(1) 
(2) 
(3) Earth radius: 6000 km 
(4) 
(5) 


Earth gravitational radius: 0.9 cm (1 shoe size). 


To see how these fit in with other physical criteria, consider the following diagram of 


mass vs. radius (in natural/Planck units) for various physical objects: 
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log, radius 


log, p Mass 
-60 -40 -20 0 20 40 60 


Particles have the Compton radius R = h/Mc according to quantum mechanics, black 
holes (see subsection IXC7) have the Schwarzschild radius R = 2GM/c”. Condensed 
matter (solids and liquids) is basically atoms packed together, and has the same den- 
sity regardless of size, up to an order of magnitude or so. The size of an atom is about 
the same as the Compton radius of an electron, up to a factor of the fine structure 
constant, while stars are more or less condensed matter near their gravitational radii, 
up to a few orders of magnitude. So, known objects tend to lie near the lines drawn 
above, to within a few orders of magnitude (perhaps related to the fine structure con- 
stant a 1/137 or the proton-electron mass ratio + 1836), which is close compared 


to the tens of orders of magnitude that set the scale of the diagram. 


Exercise IXC6.1 


Consider the following very crude approximations to various types of stars: 


a Assume a star has the density of a neutron, i.e., of a sphere with the mass 
M of the neutron and radius equal to the Compton radius h/Mc. Assume 
also that the radius of this (spherical) star is equal to its gravitational radius. 
(This is roughly a “neutron star”.) Find the mass and radius, in terms of 
both physical constants and conventional units. Note the appearance of the 


large dimensionless number, the ratio of the Planck mass to the neutron mass. 


b Assume a star has the density of a “compressed” hydrogen atom, a sphere 


with the mass of the hydrogen atom (which we can take as roughly equal to 
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the neutron mass) and radius equal to the Compton radius of the electron, 
h/me for electron mass m. Assume the mass of this star is equal to that of 
the neutron star found in the previous example. (This is roughly a “white 
dwarf’.) Find the radius, again in terms of both physical constants and 


conventional units. 


c Assume the same mass again, but now assume the density of an ordinary 
hydrogen atom, which has the Bohr radius h/mca. (This is roughly an “or- 


dinary” star.) Compare to the mass and radius of the Sun. 


All experiments (excluding cosmology) are based on the Schwarzschild metric. 
The first type of experiment involves gravitational redshift, but unlike the cosmolog- 
ical case, the relevant reference frames of observation are not local inertial frames 
but the static reference frame in which the Schwarzschild metric is defined. (There 
are also measurements of redshift from airplanes, whose reference frame is defined 
with respect to the Schwarzschild one.) In this reference frame the relevant Killing 
vector is the one which expresses the fact that the space is static, K'0,, = 0/0t. The 
momentum which is measured by the observer is p*, not p” or pm, since the observer 
still uses a reference frame for which the metric at his position is flat (but not its first 
derivative, since he is not in free fall). (In fact, this is one of the purposes for using 
a vierbein, as a frame of reference.) The conserved quantity is then 


2GM « 


E=—K,p = 4/1 E 


r 


where the energy of a particle E is the time component of p* as measured in this 
frame. Thus, conservation of E for a photon gives the r-dependence of the observed 
energy E (and thus the frequency, which in turn determines the wavelength, since 
a =); 

To compare with nonrelativistic mechanics, we instead evaluate E’ for a massive 


particle in the Newtonian limit: 


GMm 
r 


M 
px (1-) (m+K)xm+K — 


giving the “conserved energy” F in terms of the “particle energy” E (rest mass m + 


kinetic A’), including the potential energy. 


The other type of experiment involves properties of geodesics, so we need to 
solve the geodesic equations of motion. Without loss of generality, we can choose the 
angular coordinates such that the initial position and direction of the particle is in the 


equatorial plane 6 = 7/2, where it remains because of the symmetry 6 < 7 — 6, as in 
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the nonrelativistic case. Also as in the nonrelativistic case, we can find constants of 
the motion corresponding to the energy EF and (z-component of) angular momentum 
L by using the Killing vectors K0,, = 0/Ot and 0/0¢ to find the conserved quantities 
K pg = K™ Gmnt” (in the parametrization v = 1): 


B= -Gimt™ = (1 — ) i, L=Gent” = rd 


r 


In the case where the particles come from infinity, these are the initial kinetic energy 


and angular momentum. We have chosen an affine parametrization, which requires 
2GM\ , 2GM\~* . 
=m? = Gane —— (1 _ a iy + (1 = a) 7 + ro 
- ¢ 


Solving the previous equations for ¢ and ¢, this reduces to the radial equation 


IGM i? 
0O=—-B74+ 74 (.-=) (S +n?) 
r r 


. H+ ( GMm2 1? out’) 
= — —. 


2 : 2 as 5(E* — m’) 

This looks like a typical nonrelativistic Hamiltonian for “energy” $(E? — m?) 
with the same terms as in the Newtonian case but with an extra r~? term. (To 
take the nonrelativistic limit for the massive case, first scale the affine parameter 
T — s/m.) Since there are good coordinate systems for a “black hole” using r as a 
coordinate (e.g., see the following subsection: r and r” + t”, as seen from the figure 
for Kruskal-Szkeres), this equation can even be used to descibe a fall into a black 
hole. (For example, for L = 0 we get the same cycloid solution as in cosmology and 


in Newtonian gravity, reaching the singularity at r = 0 in finite proper time.) 


Because of the r~® term in the potential, noncircular orbits are no longer closed. 
In particular, let’s consider orbits which are close to circular. Circular orbits are 


found by minimizing the potential for the r-equation: 


dV GMn? IL? 3GML’ 
0=—= peng ae este 


dr r2 r3 rt 
me d?V 7 IGM m? i SL* 19GMIir 
dr2 r3 rt Pr 


The near-circular orbits are described by small (harmonic) oscillations about this 


minimum, with angular frequency given by 


te PV _ GMm?(r — 6GM) 
tO dr2 r(r —3GM) 
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from solving for L? = GMm?r?/(r —3GM). On the other hand, the frequency of the 
circular orbit itself in terms of its angular dependence is just ¢ = L /r?, giving 
i GMm? 
Ww. = = 
% 72(r — 3GM) 
This means that the perihelion (closest approach to the Sun) of an orbit, which occurs 


every period 27/w, of the radial motion, results in the change of angle 


Baie 2 GM\—'? 
an + 56= | are = Su, = 20 (1-0) 
0 dT Wp ii 


GM 
=> do 61 — 
ri 


in the weak-field approximation. This effect contributes to the measurement of the 
precession of the perihelion of the (elliptical) orbit of Mercury, but so do the precession 
of Earth’s axis, the oblateness of the Sun, and gravitational interaction with other 
planets. As a result, this relativistic effect contributes less than 1% to the observed 


precession. In particular, the solar oblateness is difficult to measure. 


The effects on geodesics of photons are much easier to measure, since there are 
no Newtonian effects. As a result, the weak field approximation is sufficient. We first 
consider bending of light by the Sun: A photon comes in from infinity and goes back 
out to infinity (actually to the Earth, which we assume is much farther from the Sun 
than the photon’s closest approach to it), and we measure what angle its trajectory 
was bent by. (For example, we look at the apprarent change of position in stars when 
the Sun passes in their direction during an eclipse.) Starting with the exact solution 


for a photon’s geodesic (case m? = 0 above), we use the equations for 7 and ¢ to find 


d | E2 
a = Br 7 +2GMr 


Changing variables, 


b L G du 
=-, b=, = —_- > = d>————_ 
= me es --\ aoae 


The impact parameter b = L/E would be the closest approach to the Sun neglecting 
gravitational effects (L = rp = bE). We now make the weak field approximation: For 


a small, 
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Defining ¢ = 0 at r = ov, the integral is 
(1 — cos x)? 
om xa 
cos X 


The ends of the path (r = oo) are at exactly 
x¥=0,7 => ¢=0, 74+4a 


Therefore the deviation of ¢ from a straight line is 4GME/L. (Mathematical note: All 
variable changes were those suggested by the flat space case a = 0: E.g., b/r = sin x, 


where y is what ¢ would be in flat space.) 
A similar experiment involves measuring the round-trip travel time for radio waves 


from Earth to some reflector (on another planet or an artificial solar satellite), with 


and without the Sun near the path of the waves. Now, instead of dr/d@ we want, in 


units b= 1 
dr = 2a 1 1 mn 2a 
dt r p23 
rdr dr 
> dts + 2a 


2 —————— 
fal ea 
=d vr? —1+2a cosh™'r| 


=¢ | vr? = + 2GM cosh~'—| 


putting the b’s back. The first term (which is actually bigger) is the nongravitational 
piece (so we examine only the rest); it is the length of the side of a triangle whose 
other side has length b and whose hypotenuse has length r. We have neglected the a 
correction inside the square root in the original, exact expression: It can be estimated 
by (1) noting the argument of the square root is exactly 0 at rmin, and (2) looking 
at d(r,/...), and noting its deviation from the exact result goes as a/ r® times the 
usual, which is less than a/b’, giving a contribution of order 2GMb/rmaz, and thus 
negligible. 

For simplicity we assume both orbits are approximately circular, so rgarin and 
Treflector ave fixed (at least for the duration of the experiment); the change in b then 
comes from those radii differing from each other, so they revolve around the sun at 
different rates. We then integrate from r = rmin © 6 to r = rgarth, add the integral 
from T = Tmin tO T = T reflector, multiply by 2 for the round trip, and throw in a factor 
to convert to the proper time s of the observer (which turns out to have a negligible 
effect to this order in a). This result is then compared to the same measurement 


when both observer and reflector have revolved further about the Sun, so b changes 
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significantly (but not rgarth NOY Treflector). For « >> 1, coshtx & In(2x), so for 


b < lL Earth and Treflector WE find 


As % —8GMA(In b) 


7. Black holes 


For physical massive bodies the Schwarzschild solution applies only outside the 
body, where 7T,, = 0. The form of the solution inside the body depends on the 
distribution of matter, which is determined by its dynamics. Generally the surface of 
the body is at r >> GM, but we can try to find a solution corresponding to a point 
mass by extending the coordinates as far as possible, till the curvature components 
Rw“ blow up. The Schwarzschild metric is singular at r = 2GM. In fact, r and 
t switch their roles as space and time coordinates there. There is no corresponding 
singularity there in the curvatures, which are ~ r~*. This unphysical singularity can 


be eliminated by first making the coordinate transformation, for r > 2GM, 


, 29GM\* r 
f= far(i-=) =r+2GM In(=--1) 


and then making a second coordinate transformation by rescaling the “lightcone” 


coordinates as 


M4 saMel’*0/MGM _ sai ao 1 e(t$)/4GM 
‘i e€ 3GAr e€ 


The result is the “Kruskal-Szekeres (‘Sack-er-ash’) coordinates” 


2GM 
—ds? = G ee (eage + dr") + 1r?(d? 4 sin26 do”) 
r 


where r(r”, t’”) is defined by 


M2 _ 42 _ (4 M)? ( r -1) r/2GM 
r t (4GM) 5GM € 


This can now be extended past r = 2G'M down to the physical singularity at r = 0. 


The complete space now looks like (plotting just r” and t”): 
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t" 


In this diagram lines at 45° to the axes represent radial lightlike geodesics. Since 
nothing travels faster than light, this indicates the allowed paths of physical objects. 
Curves of fixed r are hyperbolas: In particular, the physical singularity is the curve 
v2 — 7"? = (4GM)? (r = 0), while t”? — r”? = 0 (r = 2GM) is the “event hori- 
zon” which allows things to go only one way (out from the bottom half or into the 
top half), and r = oo is both r” = +oo. Nothing can communicate between the 2 
“outside worlds” of the left and right 90° wedges. In particular, a star which col- 
lapses (“gravitational collapse”) inside its “gravitational radius” 2G'M is crushed to 
a singularity, and the spherically symmetric approximation to this collapse must be 
represented by part of the Kruskal-Szekeres solution (outside the star) by Birkhoff’s 
theorem, patched to another solution inside the star representing the contribution 
of the matter (energy) there to the field equations. This means using just the top 
and right 90° wedges, with parts near the left edge of this modified appropriately. 
The top wedge is called a “black hole”. (If a situation should exist described by just 
the bottom and right wedges, the bottom wedge would be called a “white hole”.) 
Similarly, stable stars are described by just the right wedge, patched to some interior 
solution. This right wedge represents the original Schwarzschild solution in the region 
r > 2GM where its coordinates are nonsingular. In that region lines of constant t 


are just “straight” radial lines in the Kruskal-Szekeres coordinate system (r” ~ t’). 


Besides the fact that nothing can get out, another interesting feature of the black 
hole is that an outside observer never sees something falling in actually reach the event 


horizon: Consider an observer at fixed r > 2GM using Schwarzschild coordinates, 
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so his proper time s ~ t. Then light radiating radially from an in-falling object is 
received later and later, up till t = oo, by the observer as the object approaches the 
event horizon, although it takes the object a finite amount of proper time to reach 


the event horizon and the physical singularity. 


Exercise IXC7.1 
Apply the methods of subsection IXC3 to the equations of motion in a 
Schwarzschild metric of subsection [IXC6 for a massive object falling straight 
into a black hole (angular momentum L = 0): Solve for r,7 in an appropriate 
parametrization to show that it takes a finite proper time to reach the event 
horizon from any finite r outside it. (Hint: You can also try using the gauge 


v =r instead of 1.) 
There are also more complicated black-hole solutions with spin and electric charge. 


Another interesting effect of the event horizon is the eventual decay of the black 
hole (“Hawking radiation”): Pair creation can result in a similar way to that in 
an electrostatic potential of sufficient strength (see exercise HIB5.1). Particles are 
emitted near the event horizon (the edge of the gravitational barrier), carrying energy 


off to infinity, while their antiparticles fall into the singularity. 


There are two features of the black hole that are less than desirable: the existence 
of singularities indicates a breakdown in the field equations, and the existence of event 
horizons results in an “information loss”. Both these properties might be avoidable 
quantum mechanically: For example, quantum effects can generate curvature-squared 
terms in the effective action, which modify the short-distance behavior of the theory. 
One might think that such short-distance effects would have an effect only at short 
distances away from regions of high curvature such as the singularity, and thus remove 
the singularities but not the event horizons. However, it is possible (and examples of 
such solutions have been given) that the prevention of the creation of the singularity 
in stellar collapse would eventually result in a reversal of the collapse (“gravitational 
bounce”): The would-be black hole solution is patched to a would-have-been white 
hole by short-distance modifications, resulting in an exploding star that initially re- 


sembled a black hole but has no true event horizon. 


Although “compact” bodies have been observed (e.g., at the center of our galaxy) 
with masses large enough to be black holes (i.e., too large to be neutron stars), their 
sizes have not been determined to be as small as their event horizons, although our 


present knowledge of astrophysics does not provide for an alternative explanation. 
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sd X. SUPERGRAVITY — 


In the previous chapter we studied the symmetry principles behind general rel- 
ativity; now we add supersymmetry to the picture. Supergravity is a fundamental 


part of many of the applications of supersymmetry. 


ipueaseiegeeees Fox BU PAGS cscs scvsiraceesnsesaconas 


We first need to understand the “geometry” associated with local supersymmetry. 


1. Covariant derivatives 


In subsection [VC3 we discussed superspace covariant derivatives for super Yang- 
Mills. Similar methods can be applied to supergravity, the theory of the graviton (spin 
2) and gravitino (spin 3/2). In that case we want to gauge the complete (unbroken) 
global symmetry of the theory: Besides the obvious Poincaré and supersymmetry, 
there is also the axial U(1) (“R”) symmetry that transforms the spin-3/2 field. (The 
best we might have expected is superconformal symmetry, which also has conformal 
boosts and scale, but which are broken by the vacuum just as in ordinary gravity, 
and S-supersymmetry, which is also broken because it’s the square root of conformal 
boosts.) It is introduced in the same way as local Lorentz invariance in ordinary 
gravity, and acts on flat spinor indices (but cancels on vector indices). We therefore 
want to gauge the translations Oj, (which have been generalized naturally to super- 
space from O,, appearing in ordinary gravity to include supersymmetry), the Lorentz 
generators Mag, M,, of ordinary gravity, and the (second-quantized) hermitian U(1) 


generator Y, defined to act on the covariant derivatives as 


[Y, Va] = —$Va, [Y, Va] = Va, ye Vval=o0 


v4 


We now use the “ ” to refer to hermitian conjugation without reordering, i.e., keep- 
ing the partial derivatives and other generators on the right. (As in ordinary gravity, 
transformations are not truly unitary, and covariant derivatives truly hermitian, be- 


cause of ordering.) 


Then the gauge parameter, covariant derivative, and field strengths are expanded 


over these generators, as in ordinary gravity: 
K = KMOy + 1K Mga + 4K%My, + iK AY 


Va = Eau + 324°" Mg + 3.24°9M, 5 + iAaY 
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[Va, Ve} = Taso Vo t+ $Rap’ M5, + 3Rap” Mj. +1 aby 


(E4™ is known as the “supervierbein” or “vielbein”.) Alternatively, we can write the 
M and Y terms collectively as 54? Mga (and similarly for the covariant derivative 
and field strengths), where M,z are the generators of OSp(3,1|4), by algebraically 
constraining K4? to contain just the appropriate pieces (and relating K® to K® in 
the usual way). Also, the shorthand K‘M, now includes Lorentz and U(1) terms. 


Exercise XA1.1 
Use the definition in the above commutation relations to express the torsion 
Tap® directly in terms of the structure functions C4g°, Lorentz connection 
4°7, and U(1) connection A4. 


The constraints in supergravity are a combination of the kinds used in ordinary 
gravity and super Yang-Mills: those that (1) define the vector derivative in terms of 
the spinor ones 

—1V 03 = {Va, Va} 


(2) define the spinor (Lorentz and R) connections 


—— BY) b 
fet, = Ty a3 = Thy = 0 


and (3) allow the existence of chiral (scalar) superfields 
V.2=0 = {Va Vele=0 (Y@ = 0) 


(The first two constraints imply the generalization of this chirality condition to Y 4 0 
and chiral superfields with undotted indices, like the Yang-Mills field strength.) 


Exercise XA1.2 


Rewrite the first and last set of constraints directly in terms of field strengths. 


The explicit solution of all these constraints is a bit messy, but we will need 
only a certain subset of them to find the prepotentials and supergravity action. The 
form of the solution is a generalization of super Yang-Mills in a way similar to how 
general relativity generalizes ordinary Yang-Mills. In particular, just as the vierbein 
Cg = €q'"Om is a generalization of the Yang-Mills vector A, to describe gauging of the 
translations, the generalization of the super Yang-Mills prepotential V to supergravity 
is H = H™(—i)0,, which appears in an exponential e” just as V appears as e”’: The 


chirality-preserving constraints, expressed explicitly in terms of the vielbein, is 


{ Ea, Eg} = Cag’ Ey 
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If the commutator vanished like the Yang-Mills case, E,, would be partial derivatives 
on some complex two-dimensional subspace, the usual 0, up to some complex (su- 
per)coordinate transformation, as for d, in flat superspace. However, the fact that 
their algebra still closes means they still generate translations within such a subspace, 


and are thus linear combinations of such partial derivatives: 
Ey= UNE, E,=e de®, 2=2™(-i)du 


where we have separated the matrix coefficient into a local complex scale (scale @ 
U(1)) w and a local Lorentz transformation N,“. For most purposes we will find it 


convenient to fix all these invariances by choosing the gauge 
~Nwt =e => E,=E, 


As for Yang-Mills, solution of the chirality condition introduces a new, chiral 


gauge invariance: 
e@ = eiMe%e*K, A= AM(-i)Oy, K = K™(-iOy 
[O;, Al ~ Oy => O,A™ = 0, AM = 0 


where A is not chiral, since it generates terms in the transformation law of E,, that 
can be canceled by including 0, A” terms in the transformation law of WN". This 


means we can use A” and K” to gauge 
QM=2F=-0 => N=N"(-i)dn => E,=4,+ LE," 
where pe = —10,02" +... from expanding the exponentials as multiple commutators. 


We can again transform to a chiral representation, and work in terms of 
5 n Oe 3 A A _ 
eae. eae a E;, = 0; Bi =e" Oye" 


where U now generalizes the constant (U) = 6“0"(—i)9,,, used in flat superspace. Also 
as for Yang-Mills, the usual local component transformations (now for coordinate, 
supersymmetry, scale, U(1), and S-supersymmetry) reappear in the chiral parameters 
A™ and A+. 
For a component analysis, we look at the linearized transformation (see exercise 
IVC4.3) 
6U™ = i(A — A)™ —ig[(U), A+ Al” 


= i(A— A)™ — 40”6"0,;,(A + A)™ + (04 A# — GA AY) + 13 (0?0"0, 2AM — 076" a, AP) 
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where for A we use as independent just the chiral parameters A” and A“; the nonchiral 
A", having already been used to gauge away U“, is now fixed in terms of the others 
as 

A® = @ YU Abe¥ 
to maintain U“ = 0. The first term in the transformation tells us that the surviving 


component fields are the same as for super Yang-Mills, with “m” as the group index: 
U™ = €4™ (89), ba” (878), va (00), A™ (676) 


The second term in the transformation law gives de," ~ —0,\"™ from A™| = A™| = 


A\™. Then A“ contains the rest of the gauge parameters: 


AY = ea + ib(8), A,“(0), C4(07) 


= supersymmetry, scale +iU(1), Lorentz, S-supersymmetry 


The third term in the transformation law then shows scale and Lorentz gauge away 
pieces of the vierbein, as usual, while S-supersymmetry gauges away the trace of w,”. 
It also forces A” to include é at order @ to maintain the gauge; we then see that 
Wo" is the gauge field for supersymmetry, with contributions from the second and 
fourth terms. Finally, the fourth term also shows that A™ is the gauge field for U(1). 
The resulting component content is that of “conformal supergravity”, which will be 


transformed later to ordinary supergravity through a compensator superfield. 


For perturbation theory, or comparison with global supersymmetry, we should 
expand about “flat” superspace (which is nontrivial because of nonvanishing torsion 


fe 3° in empty superspace). We then modify the chiral representation: 
e%e2% = UY = eU/%eH U)/2 sg Ey = dz, By = ede 


We now expand all derivatives over the covariant derivatives dy, of global supersym- 


metry (constructed from (U) as before) 
H=H™(-i)dy, Ea, =Ea™@dy 


instead of over partial derivatives O,,, which is just a change of basis. This also 


modifies the description of the A gauge parameters: 


ef = Mea A= AM (—-i)dy, = K™ (-i)dy 


[dx, A) ~d, = Ady +A™On = 4{d", [d;, L'd,]} 
> AM=PLY, Ath = idk Le 
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in terms of a new parameter L”. From this we find the linearized transformation law 
SHEE w& dELE — dEL# 


Exercise XA1.3 
Expand this transformation law in components, and compare with the previ- 


ous analysis. 


Besides chirality, we also need a certain combination of the other constraints: 
(=7,° = TAP = (l'Cs3" $C,” =7Ay 


where the A term comes from the contribution yi AS, to T. ce (See exercise XA1.1.) 
Using the gauge Ey = E,, without loss of generality, we then find (comparing similar 


manipulations in subsection [IXA2) 
-iA, = EdyEE,™ = EEE 


where the backwards arrow on E, = E,™ 0), means all derivatives act on everything 
to the left (see subsection IA2), and 


E = sdet E,™ 


(The superdeterminant was defined in subsection IIC3.) 


We now need the general identity, for any function A and first-order differential 


operator B, 


— — — 


Ae® = (1- Be) Ac? =(1- eB) (eB Ae?) = (13 eB) (e® Ae~®) 
= (1-e?)(e? A) 


— — — 


=> 1=(1-e?)e? = (1-e?)[e7(1-e77)| 


The final result in the gauge w = 1 (N," is trivially restored) is then 


This can be used to solve chirality conditions on matter fields: In this gauge, we 
have 
YO=y > 0=VO= (Ext iyAg)® 


= =e}, 0,6 =0 
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Again as for Yang-Mills, the chiral-representation field @ transforms under only the 


A transformations: 
b = (1-e!4)%e'¢, A= —i(A" Om + A#O,) 


Thus, scalars in the real representation become densities in the chiral representation 


(except for y = 0). In particular, we have for the special case 
gol. = ¢agde* = 5 f de a6. =0 


which will prove useful later for chiral integration. For now, we note that such a 
chiral scalar, with y 4 0, can be seen to compensate from the 0,,A“ term: This term 
allows U™ to eat the complex “physical” scalar and spinor, fixing scale, U(1), and 
S-supersymmetry, while the complex auxiliary scalar survives, along with coordinate, 
Lorentz, and supersymmetry invariance. We also note that for perturbation about 


flat superspace we have 
i(1- A) = a,A™ — dA" = —d?d,,L* 


Exercise XA1.4 
Show that preservation of the chirality of ¢@ implies the previous chirality con- 
ditions on A. Thus, as for nonsupersymmetric or nongravitational theories, 
the gauge group follows more simply from starting with matter representa- 


tions. 


We also note that in the gauge w = 1, and also Q4 = Q# = 0, the superdetermi- 


nant is simply (see subsection IIC3) 
EI = det(Em") 


where E,,7 is a component of Ey,“ (not the inverse of E,"”). 


2. Field strengths 


These constraints can be completely solved for all the field strengths. Alterna- 
tively, we can impose them, together with the Bianchi identities (Jacobi identities 
of the covariant derivatives), to find a smaller set of algebraically independent field 
strengths, and the differential equations that relate them. The method is analogous 
to the case of super Yang-Mills treated in subsection IVC3. We begin with the con- 


straints analogous to the Yang-Mills ones: 


{Vis V a} = —tV og LY as Va} = Roe’ Mz 
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(where the latter will simplify from later results). From just the latter, we find 
[Via {Vs Vy}]=0 =  Reapy)’ — 195(a’ Fay) = Viaayy’ = 0 
Using both constraints, we also have 
[Via, {Va), Vs}] + [V5, {Va, Va}] = 0 
= [Vea V5] = tRapy"V5 — 3 FapV5 — (V5 Rag") My 
= [Va Vai] = ~iCapWy + fiRagi'V5 — tFanV5 — 31(VyRap!) Mr 


for some operator Ws = W:AV At Ww:! My. So far the exercise has been analogous 
to the super Yang-Mills case (where the extra “i” in the definition of W is due to 
our use of antihermitian generators, except for Y). Now we impose the remaining 


constraints, which can be combined conveniently as 
i, ee sa ee 
0 = Tae? = —iCagWy? => Wa = WalVp+WalV5 + Wa) M1 


Following again the steps for Yang-Mills, we analyze the next-higher-dimension 
Jacobis, beginning with 


0 = {Vas [Va), Vig]} + [V 45, {Var Vat] = tCya{Ve),W3} + Aas 


A 71(Viakg gs) Ve — 3(V Fay) V5 — 54(ViaV 3 Rayy) Mr 
+ FRias Wane + fF aV g3 — 34(V 3 Ry(0p)’)V5 — 3(V 3 F(a) Ve) 


+ (V_3Rop")Mr — Ropy Vey — Ragy' V8 


yao — 


By inspection, or applying the previous Jacobis, we see 


A i= 0 = A C64 A.s= aoe: 


(ya) ad 6)5? aB 3 B 


automatically, so the only new information comes from the trace of this Jacobi, 
{Va,W3} =tA,s 

Evaluating {V,W} in terms of its pieces, we find 

Rea” = Fog = 9, ig at 0058)B, Wa" = — Bo, 

We = VaB + VWaza, WwW?) = iv ey,” 
Wif=V.w:"= WwW. = —60(W.® + 1579) .) B 
VaWs = VaW;" = 0, VaW;" = 6g/(Ws + 31V JB 

where W, is the Y part of W, (= W,tY +...). 
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Exercise XA2.1 
Show that 


{Vo,Ve}=BMis, ViaVeVy=0 = V.(V? +B) =—-5ZBV? Moa 


and thus V +B gives a chiral superfield when acting on any superfield without 
dotted indices. 


For the other Jacobi of this dimension, we have 
0 = [Vaa, {Va, V4}] + {Va, [V4 Vaal} + (V4, Ve, Vaal} 
= =i[Vaas Vas] — i Ve, CyaWa + H(VaBY Mya} - fF 5, CooWa + (a8) Mpa} 
= —1C34| fag — {Va, Wa} + Ly B) Mag —h.c. 
= fas = {Via Wa}—3(V B)Mas, — {V%, Wa} +{V", Wa} =0 


(Here “h.c.” means “hermitian conjugate” without the reordering, which would gen- 
erate non-operator terms.) 
Evaluating {V, WV} in terms of its pieces, and combining with the results of the 
previous Jacobi, we obtain the final result: 
{Va, Va} = BM 4% {Va; Va} =— V ap 
[Vio iV 3] = CasWe — 3(VeB)M a2, — [—i Vad —*V gal = Cag fap — h.c 


Wa = — BVa— GaPV 5 + 3(V?Ga1)M gy + 3Wa Mag + WaY + i2W? Mpa 


fos = 13G(a'V ayy — 3(V(aB + 13 Wa) Va) + Wap V1 — 3(V (eG) )V5 
— (3V°B + BB+ FiV'W,) Mog — tigl(V eG y3) Ma? +a 8] 
+ 3Wop? Ms, + 3(VaV'Gay ys, + t9(V(aWgy)¥ 
Wapys = _ avi aW 46) 


The “reduced tensors” B, Go, Wa, Wagy satisfy the “reduced Bianchi identities” 


V"'Goa = VaB —iWa 


Note that B or W, may vanish in certain gauges, for reasons to be explained in 
subsection XA4. 


Exercise XA2.2 


In IXA4 we saw that integrals of total covariant derivatives vanished in curved 
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space by virtue of the identity T,,? = 0. Show that these torsions satisfy the 
superpace generalization 
(—1)" Tap” =O 


Exercise XA2.3 
Using the expression for €g5-q in terms of spinor indices from subsection ITA5, 
show 

Thed = G€abed 

Thus G;, is an axial vector. 

Exercise XA2.4 
By hermitian conjugation, find the commutators not written explicitly above, 
and show the result is essentially the same as switching dotted and undotted 
indices (and similarly for bars), except that G,, Y, and W% (and W°) get 
extra minus signs. This illustrates CP invariance, and the fact that G, is an 
axial vector, while Y is a pseudoscalar (and similarly for W°). 

Exercise XA2.5 
Derive the Bianchi identities in the absence of constraints, in terms of the 


torsions and curvatures (as follow from the Jacobi identity): 
Vial poy? — Tap)’ Trcy” = Russo)” 
VaRgoy' — Tap) Rac)’ = 0 


Exercise XA2.6 


Show that in 4-component notation we can write 
Tap? = —i7$¢; Tap? =YapsG?,  G*=-G, VeG8=we 


This gives another way to see the result of exercise XA2.2. Show that this 
expression for G°? = (G*, B, B) gives it an interpretation as an SO(3,3) 6- 


vector in SL(4) notation (see subsection IC5). 


3. Compensators 


Just as in ordinary gravity (see subsection IXA7), compensators for scale trans- 
formations can be introduced, but for supergravity the compensator should be a 
supersymmetric multiplet. The simplest choice is the chiral scalar superfield @ con- 
sidered earlier: Its complex “physical” scalar (scalar +i pseudoscalar) compensates 


local scale (the real part) and U(1) (the imaginary part), its spinor compensates local 
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S-supersymmetry, and its auxiliary complex scalar appears as one of the auxiliary 


fields of supergravity. 


Compensators are much more important in supergravity than in ordinary gravity: 
Almost any flat space action can be coupled to gravity by the minimal coupling 
prescription — replacing derivatives with covariant ones, and throwing a factor of 
e! in for the measure. In supergravity this is not the case: As we’ll see in section 
XB, we have both integrals over all superspace, which use E~', but also integrals 
over chiral superspace (for integrating chiral superfields), which instead use & for the 


measure. The minimal coupling procedure is then: 

(1) Use ® (and &) to make a flat-superspace action superconformally invariant, 
(2) replace the flat derivatives d4 with the curved ones V4, and 

(3) throw in the measure factors appropriate for the integrals. 


(The last two steps couple conformal supergravity to a globally conformally invariant 


theory.) 


Another compensator that is commonly used is the “tensor multiplet”. (This 
is sometimes confused in the literature with the “(complex) linear multiplet”, an- 
other version of the scalar multiplet with no gauge fields whatsoever.) Treated as a 
matter multiplet, it has the same physical content as the scalar multiplet, but the 


pseudoscalar is replaced with a second-rank antisymmetric tensor gauge field, 


To make things simpler, let’s look at flat space. We first note that this tensor is 
“dual” to a pseudoscalar in the sense of switching field equations and constraints of 
the field strength (see exercises [B2.1 and VIIIA7.2): For the free fields, 


i, =6.0 =|} aly = 0, Gas $€abed’ B™ = OG,=0 
with the field equations following from “self-duality” under F' << G: 
F,= Ga > OF, = OaGy = 0 


Since the theory of B,» must be described in terms of G, alone (because of gauge 
invariance), no renormalizable self-interactions are allowed; thus, this field is of little 
interest in quantum field theory outside of supergravity. In terms of the scalar, the 
fact that only the field strength F,, appears in the field equations means there is the 
global symmetry 


dp=¢ 
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for constant parameter ¢. This generalizes to the nonabelian symmetries of nonlin- 

ear o models, resulting in derivative interactions (again nonrenormalizable) but no 

potentials. 

Exercise XA3.1 

Consider coupling the tensor field to Yang-Mills: To preserve the tensor’s own 
gauge symmetry, this coupling must be nonminimal. To produce such a cou- 
pling, we start with the scalar and duality transform. The coupling we choose 
is another 4D analog to the 2D model we considered in exercise VIIIA7.2, re- 
placing the pseudoscalar and total derivative 5” Fi» with tr(3e Fy, Fea). 
(In general dimensions, the dual to a scalar is a rank-D—2 antisymmetric 


tensor.) We start with the Lagrangian 
L = —460¢ + A@Htr (cP Fas Fea) 


for some coupling constant A. Making use of the Chern-Simons form B,y. of 
subsection IIIC6 to write ¢ in this action only as 0,¢, write a first-order form 


of this action and perform a duality transformation to obtain 
i = ree Fate = 50a Boe] + AB ake 


Find the Yang-Mills gauge transformation of By. (Hint: H is gauge invari- 


ant.) 


The tensor multiplet is described by a chiral spinor gauge field 
6d. =id’d,.K  (K=K) 


Duality is then described in terms of the real scalar superfield strength (in the free 


case) 
F=¢+¢ => @d,F =0, G=i(d.¢%+d6% => €G=0 


with the field equation 
FeH@ 


(F, appears at order 60 in G, and By» at order @ in da.) Again the pseudoscalar has 


a global symmetry: In terms of the superfield, 
oo SIC 


Now we return to curved space, covariantizing the above with respect to confor- 


mal supergravity. We now identify the above global symmetry with the local axial 
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U(1) (R-)symmetry of supergravity. Thus, the superfield G does not compensate for 
this symmetry; it remains as a symmetry in actions that use this compensator. In 
particular, there are no [ d?0 terms in such theories, except those that are locally 
superscale invariant (so the compensator decouples). The matter tensor multiplet 
also differs from the scalar multiplet in that it has no auxiliary fields (except for the 


auxiliary components of the gauge field). 


4. Scale gauges 


Since all the covariant derivatives are built up from the spinor part of the vielbein, 
we define the local superscale transformations for the covariant derivatives by first 
defining 

Ei = LE 


where L is a real, unconstrained superfield. The constraints then imply 
Vi = LV t+2(VPL)Mga t+ 6(Val)Y, Va=LVat 2(V°L)M a, — 6(VaL)Y 
From the anticommutator we find 
iV, =D? (-i) Vag + 4L (VoL) Veg + 4L(VeL)Va 
+ 40-2(VaV LAM yy + BL-2(V VOL") Mga — 8L-*( [Van Val LAW 
Using the commutation relations, we then can show 
B= L(V?+B)L4*, WL=LW,-12i(V°+B)V. in L] 
Goa = (21a Val + Gas) L?, pe = LEW ext 


From the way they appear in the commutators we also have that 
¥G,=0;, Y¥WeeaWa YW oWe YER 


From linearization, we see that B and W® pick out exactly the two irreducible halves 
of the real scalar superfield L: The “vector multiplet” in W/ and the “scalar multiplet” 
in B’. (Compare the vector multiplet field strength and chiral scalar gauge fixing for 
the prepotential V as described in subsections [VC4 and VIB9.) This means we can 
completely fix the superscale gauge by the choice 


B=W,=0 


as the generalization of the scale gauge in ordinary gravity that fixes the Ricci scalar 


to vanish. 
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Exercise XA4.1 


Derive the superscale transformations by use of the Bianchi identities: 


a Use the commutation relations of the covariant derivatives (and the solution 
to the Jacobi identities) to find all the transformations above. Show they 
imply & = Lk. 

b An easier way is to use the reduced Bianchi identities: Determine the trans- 
formations of the reduced field strengths, up to constants, using chirality, 
dimensional analysis, etc., and then solve for the constants by plugging into 


the reduced identities. 


We then define the scale (and U(1)) transformations of the compensators: 
P=L76, Yb=i1b, V,=0 
Care YG =0; (V2 + B)G =0, G=G 


where the scale weights follow from the U(1) weights (vanishing for G by reality) by 


consistency with the constraints they satisfy. 


Exercise XA4.2 
Show that a superfield can be chiral only if it has no dotted indices. Then 


show the relation that any such superfield has between scale and U(1) weights. 


All these transformations can be derived either by consistency with the con- 
straints, or by using the solution of the constraints: In terms of the unconstrained 


superfields that solve the constraints, the superscale transformation is trivial: 
y' = Ih, NaN. pes 


The net result, as for super Yang-Mills, is that all the superficial transformations of 
the constrained covariant derivatives are completely replaced with the new invariances 
that appear upon solving the constraints: K_, and L eliminate 7), K°® kills N,“, and 
K™ reduces Q™ to its real part U“, which transforms only under A”. 

Exercise XA4.3 


Rederive A, as in subsection XA1, but in a general gauge, to find 


— 


iA, = EQT, eT = WP E(1- 7) 


Show this result gives a superscale transformation for A, that agrees with the 
result above. Show the explicit solution for @ in terms of ¢ and T also gives 


it a superscale transformation that agrees with the above. 
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Exercise XA4.4 
Often it is easier to use the solution to the constraints than the Jacobi iden- 
tities: 
a Solve for f'4z in terms of Ag, and use the solution for A, from subsection 
XA1, to derive 
W, = —i(V + B)V.(T +7) 


and use this to rederive the superscale transformation above. (Hint: Define 


and use the chiral representation.) 


b Find an explicit expression for B, and use it to rederive its superscale trans- 
formation. (Hint: You will need to find ,°7 first. Since B is a scalar, you 


can choose the Lorenz gauge Nj" = 64.) 


In subsection XA1 we found that a convenient way to simultaneously fix Lorentz, 
U(1), and scale gauges was to choose E, = E,. (However, the corresponding com- 
ponent invariances reappeared in the chiral gauge invariances.) Compensators allow 
more freedom for gauge fixing: For example, we can fix the gauge B = W, = 0 as 
described above, or we can fix to 1 the compensator or a physical matter multiplet 
(string gauge, as for gravity in subsection IXB5): The possibility of gauges such as 
@ = 1 or G =1 depends on the existence in the action of such fields, and not on the 
details of how they appear (as long as the gauge choice is consistent with the allowed 
vacuum values). In particular, it does not depend on the signs of their kinetic terms, 


which is the only thing that determines what is physical and what is a compensator. 


Note that either = 1 or G = 1 completely fixes the superscale gauge, in spite 
of the constraints on these superfields. (E.g, 6 = & = 1= L=1.) This is due to 
the appearance of the U(1) connection: For example, before fixing the scale and U(1) 


gauges the chirality condition on &, rather than constraining @, actually determines 


the spinor U(1) connection Ag: 
Val = (Ea —igA,)P=0 > A= -3iE, In® 


(But the chirality of the ratio of two chiral superfields with the same weights really 
fixes it to be chiral; in other words, chirality of scalars makes all but one truly 
chiral, since the U(1) connection can be determined only once.) As a result, the 
scale (6@ = 1) and U(1) (@/@ = 1) gauge choice = 1 determines W,: 


@=1 > A=0 > Wy=0 


Similarly, 


G=1 =} 0=(W+B)G=B8 
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Conversely, we see that whenever one of the two field strengths B and W° is elim- 
inated by a superscale(/U(1)) gauge choice in terms of one of the two compensators 
@ and G, the other field strength can be made superscale invariant: If we introduce 
the compensator by a superscale transformation (as for gravity in subsection IXA7), 
substituting either 

L-* = 60 or G 


in the above transformation laws, we find 
B= (66)3/(7" + B)Sb = O?26-3/2(°7" + B)S 
Wa = G/41W, + 34(V" + B)Valn G] 


as locally superscale invariant, where using @ for W, or G for B yields zero. We 
can therefore interpret gauging away the compensators as gauging them into the field 


strengths: We have a choice of either 
@=1 > W.=0, B=B 
G=1 > B=0, Wa=W, 


This is analogous to Stiickelberg gauges (and their nonlinear generalizations): One of 
these two tensors (gauge fields with respect to superscale) “eats” the compensator. 
However, it differs from Sttickelberg in that a second “gauge field” is completely 


gauged away. 


In fact, we'll see in section XB that the pure supergravity actions constructed 


using either of these compensators gives the corresponding field strength as its field 


equation: ; 
— => B=0 
o@ 
5 — 
=> W,=0 
0@y 


Thus, either compensator can be used to eliminate both B and Wg, one as a field 
equation and the other as a gauge choice. This result is already clear at this point 
from dimensional analysis and chirality; similarly, we must have 

re) ss 
sum Ge=0 
where G, is the result of applying a superscale transformation to G, with whichever 
of the two compensators is being used in the action. This leaves W.g, as the on-shell 


field strength. The analogy to ordinary gravity is 


(A, Rav —_ sNavR, Wabca) wo (B/W., Gas Wag) 
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Although the Ricci tensor must appear in G,, superscale invariance allows the 


choice of gauges where the 6 = 0 component is arbitrary: From the above we find the 


linearized transformations 


0Goad ~ AlVa, ValL, 0 Aga y —6[Va, ValL 


(For purposes of evaluating at 9 = 0 we can neglect A,| in 6A,.) Thus, this axial vec- 


tor component field can be moved around as convenient for component expansions. In 


Cus they appear only in their invariant combination, Gg + 2 Aq in this approximation. 


In nonsupersymmetric gauges, we can even gauge G,| = 0. 


Exercise XA4.5 
Use the Bianchi identities instead of explicit superscale to track down the 


axial vector: 


a Use the relation of Wag (which is scale covariant) to W, (the field strength 
for Ay) and G, to show that it is just this combination G,+ 2A, that appears 


in Wey (as its curl, for U(1) invariance). 


b Show that 


Gel > B=0 >» WG. =0 


Thus, in this gauge the axial vector gauge field A, has been gauged out of 
G,| (although its field strength may appear at higher order: the gauge G = 1 
doesn’t fix U(1)). What replaces it? (Hint: What’s in G'’?) 


REFERENCES 


1 


2 


3 


4 


5 


6 


D.Z. Freedman, P. van Nieuwenhuizen, and S. Ferrara, Phys. Rev. D13 (1976) 3214: 
supergravity. 

V.P. Akulov, D.V. Volkov, and V.A. Soroka, JETP Lett. 22 (1975) 187: 

covariant derivatives for supergravity. 

V. Ogievetsky and E. Sokatchev, Nucl. Phys. B124 (1977) 309; 

V.P. Akulov, D.V. Volkov, and V.A. Soroka, Theor. Math. Phys. 31 (1977) 285; 

S. Ferrara and B. Zumino, Nucl. Phys. B134 (1978) 301: 

linearized off-shell supergravity (including auxiliary fields). 

P. Breitenlohner, Phys. Lett. 67B (1977) 49; Nucl. Phys. B124 (1977) 500: 

off-shell supergravity in components. 

Wess and Zumino; Wess; loc. cit. (IVC, ref. 5): 

superspace field equations. 

W. Siegel, Supergravity superfields without a supermetric, Harvard preprint HUTP- 
77/A068 (November 1977); 

The superfield supergravity action, Harvard preprint HUTP-77/A080 (December 1977); 
A polynomial action for a massive, self-interacting chiral superfield coupled to super- 
gravity, Harvard preprint HUTP-77/A077 (December 1977); 


680 


10 


11 


12 


13 


14 


X. SUPERGRAVITY 


A derivation of the supercurrent superfield, Harvard preprint HUTP-77/A089 (Decem- 
ber 1977): 

off-shell supergravity in superspace, compensator superfield. 

M. Kaku, P.K. Townsend, and P. van Nieuwenhuizen, Phys. Rev. D17 (1978) 3179: 
conformal supergravity in components. 

M.F. Sohnius and P.C. West, Phys. Lett. 105B (1981) 353: 

R-symmetry generator in superspace covariant derivative. 

V.I. Ogievetsky and IV. Polubarinov, Sov. J. Nucl. Phys. 4 (1967) 156; 

K. Hayashi, Phys. Lett. 44B (1973) 497; 

M. Kalb and P. Ramond, Phys. Rev. D9 (1974) 2273; 

E. Cremmer and J. Scherk, Nucl. Phys. B72 (1974) 117: 

antisymmetric tensor gauge field. 

J. Wess, Acta Phys. Austriaca 41 (1975) 409: 

N=2 supersymmetric tensor multiplet. 

W. Siegel, Phys. Lett. 85B (1979) 333: 

N=1 supersymmetric tensor multiplet. 

H. Nicolai and P.K. Townsend, Phys. Lett. 98B (1981) 257; 

E. Bergshoeff, M. de Roo, B. de Wit, and P. van Nieuwenhuizen, Nucl. Phys. B195 
(1982) 97; 

G.F. Chapline and N.S. Manton, Phys. Lett. 120B (1983) 105: 

Yang-Mills appearing as Chern-Simons contribution to tensor field strength. 

P.S. Howe and R.W. Tucker, Phys. Lett. 80B (1978) 138: 

superscale transformations. 

Gates, Grisaru, Roéek, and Siegel, loc. cit.: 

complete superspace treatment of N=1 supergravity. 


B. ACTIONS 681 


Now that we understand the structure of superfields in curved superspace, we an- 
alyze various supergravity theories through their actions. We will use several methods 
for finding and evaluating supergravity actions. These actions are significantly more 
complicated than those we have encountered previously, and it is difficult to see all 
their features simultaneously, so for any particular application we use the method 


which best simplifies the property we most need: 


(1) Superspace methods are the best for finding general actions and their symmetry 
properties, manifesting supersymmetry, using globally supersymmetric gauges, 


and performing quantum calculations. 


(2) Component methods are useful for comparing actions and other properties to 
nonsupersymmetric theories. Such approaches sometimes make some use of su- 


perspace, but not superspace integration. 


(3) Compensators are useful in conjunction with either of these methods, and can 
extract many important features and terms in the action with little more than 
the results of global supersymmetry. They reveal useful broken symmetries, and 
are the simplest way to analyze the “superhiggs effect” (Higgs for local supersym- 


metry). 


1. Integration 


The action for supergravity follows from dimensional analysis: Since the usual 
Einstein-Hilbert Lagrangian has dimension +2, as does [ d*@, the superspace La- 
grangian must be dimensionless. The only covariant possibility in terms of the po- 
tentials (E,™ and 2,/) is thus 


Ssq = 3 f aa d‘6 EO 


including a normalization factor that will prove convenient later. Introducing the 
compensator and local scale invariance must also make the usual action for super- 


gravity look like the kinetic term for the compensator multiplet, i.e., 
Ssce=3 i dx d*0 E'b® 


(For simplicity we will restrict ourselves for the most part to the simplest compensator, 
the chiral scalar.) The previous form then corresponds to the scale gauge 6 = 1; 


often the scale + U(1) gauge w = 1 is more convenient. 
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There should also be a supersymmetrization of the cosmological term. This might 
seem difficult, requiring explicit prepotentials. However, we know from our study 
of de Sitter space that the cosmological term is basically a statement about the 
conformal compensator. Therefore, the cosmological term for supergravity, in terms 
of the superconformal compensator ®, should be the supersymmetrization of the 
corresponding term in ordinary gravity, a dimensionless self-interaction for a scalar. 


The solution of the chirality condition can be written as 
YP=-e => P= Seek 
in the gauge w = 1. The result for the cosmological term is then 
Sscosmo = / dz d’°6 E18 + hic. = / dx a6 6° + h.c. 


independent of scale or U(1) gauge: As we saw in subsection XA1, this expression is 
invariant under A transformations, and the integrand itself is invariant under K and 
L transformations. 
Exercise XB1.1 
Although this final result for the cosmological term in terms of ¢ is locally 
superscale invariant, the derivation started in the gauge yw = 1. Generalize 


the derivation, and the result in terms of ®, to arbitrary gauges (see exercise 
XA4.3). 


A chiral expression for Ssqg can be found by similar methods: 
sso = 3 faa cep "B= 3 [ae cops 


Thus, as for super Yang-Mills, the action can be expressed as a real, chiral, or an- 


tichiral integral. With the compensator, 
Ssq = 3 fae 6 B6(7 + B)b = 3 fae do E'0(V? + B)® 
or, more generally, 
pu cof LS ux d°0 oe a +B)L= fu a6 E1(V? + B)L 


which is just the naive covariantization of the flat-space result (at least in the gauge 
w = 1: see exercise XB1.1). Clearly this method generalizes to coupling to other 
multiplets, and allows both [ d?@ and f[ d*0 integrals to be generalized to curved su- 
perspace. In fact, the analysis of the compensator is much simpler than that of the 


conformal supergravity that couples to it to produce ordinary supergravity: Just as 
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in ordinary gravity, where use of just the compensator allowed us to study certain in- 
teresting solutions in gravity, namely de Sitter space and cosomology, some properties 


of supergravity can be analyzed in terms of just the compensator. 


A simple expression (as simple as the super Yang-Mills case) can be written for 
the supergravity action in terms of unconstrained superfields. We first give a first- 
order action, analogous to the one for Yang-Mills (subsection IVC5): In that case the 
action was in terms of V and A,; here it is in terms of U™ and E,,%. Using just the 


constraints solved in subsection XA1, we write the action as 
Ssai = 3 fas d‘@ E'éG(1 — 1T, 4°) 


in terms of the torsion T’, Ae (Note the similarity to the Yang-Mills case, replacing the 
Chern-Simons form with the same component of the torsion.) We already evaluated 
everything except this torsion, which is also easily found in the gauge w = 1, Na" = 04, 
04 = OF =0: 


A 


Sse. = 3 faa d*0 det(E,*)@P(1 — +E," En") 


@ = (det(En®)|-¥3(1-e®)/e%, — {By, Ex} = —iBas™On 


In the chiral representation this simplifies to 
Ssoa = 3 / dx d'O [det(Em*)]/3(1 - e~¥) 3 o(e-" 6) (1 — LE 105. 24) 


Exercise XB1.2 
Find the algebraic field equation for E,,,°. Use this to eliminate it from the 
action, yielding expressions for E,,“, E, and the (second-order) action in 


terms of U™ only. 


The expansion of the superspace action in terms of unconstrained superfields is 
needed for supergraphs, the most efficient way to do quantum calculations. We will 
not consider quantization here; the methods are similar to those described in subsec- 
tions VIB5, 9-10, and C5 for super Yang-Mills. In particular, one uses background 
field methods: For example, as for super Yang-Mills, 


ce? > e@BeQ se OB me Ege", ef = e@ae%a 


The end result is that the expansion is about background-covariant derivatives D4, 


e.£., 
Va=e Dye, Va =e (WDa + 27 Mpe® 
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etc. However, D4 satisfy the same constraints as the full covariant derivatives: For 
example, they have nonvanishing torsion LAB = — 18455, This differs from the ex- 
pansion implied above in U™ about partial derivatives, which anticommute without 
torsion: For a perturbation expansion useful for quantum calculations, one must ex- 
pand in h,» about (e,”"), rather than in e,” itself; thus (at least) the vacuum value 
(U) must be separated from U. 


2. Ectoplasm 


Although all supersymmetric theories can be analyzed directly in superspace (in- 
cluding classical solutions, effective potentials, Feynman graphs, etc.), for comparison 
to nonsupersymmetric theories it is necessary to expand superfields in components. 
Since all fundamental theories are described by actions, it is sufficient to give a pre- 
scription for evaluating any action in terms of component fields, as in subsection [VC2 
for global supersymmetry. In locally supersymmetric theories, the vielbein needs to 
be expanded in terms of the prepotentials for supergraphs. We can also find compo- 
nent actions by a straightforward Taylor expansion in @ of the prepotentials in the 
superspace action. However, for component expansions of classical actions, one can 
get by more simply by applying Bianchi identities to the covariant derivatives and 
differential forms (antisymmetric tensors). It is unnecessary to know even the explicit 


form of the measure in terms of the vielbein or prepotentials. 


The fundamental idea is to think of the Lagrangian not as a scalar times a mea- 
sure, but more “geometrically” as an antisymmetric tensor. Although this approach 
does not work for the usual superspace Lagrangians because of the peculiarities of 
fermionic integration, it can be applied to component Lagrangians integrated over 
4D spacetime, treated as the bosonic subspace of superspace. We thus write the 


component action as 


S 


a / da dae ae aa? Lipa tO) 


where Lizwpg is a graded antisymmetric superfield. Of course, the action should be 
independent of 6, even though we have integrated over only x. This is equivalent to 
requiring that the integral should be independent of the choice of 4D hypersurface in 
superspace. We are familiar with a similar requirement for conserved charges, which 
are defined as integrals over 3D hypersurfaces: Treating the conserved current in 


terms of the 3-form dual to the vector, 


imma? = = 4 f dadz"de” Ip 


B. ACTIONS 685 


the dual to the usual conservation law as vanishing (covariant) divergence of the 


vector is vanishing curl of the 3-form: 
d 
He =0 => Ohne = 0 


An important point is that neither the definition of the charge nor the conservation 
law requires a metric, since integration in general does not. We thus require for our 


supersymmetric action 


O 

36° =0 => Omlnrer =9, OLunreqg= OMAN PQ) 
where the gauge invariance allows us to drop terms in the Lagrangian that are total 
derivatives (surface terms). Note that both Z and X are assumed to be local func- 
tions of the fields and their (finite-order) derivatives. (As a result, this is not the 
usual “cohomology”, where both would be allowed to be arbitrary functions of the 


coordinates. ) 


Converting the curl-free condition to flat indices (see subsection IVC5 for the 


Chern-Simons superform), 


aV aLecpe) — sql \4B\ Lricpor) = 0, 6Lascp = 4V[AABCD) — ane 4B|" Ax\cD) 


The plan is then to find Lagcop, in terms of which the action can be written as 
S= / dz (—p) er Ey? Ey? En? E mL agcp 
where E,,“ is exactly the nontrivial part of the inverse vielbein Ey,4: 
Em” = (€m*, Ym") 


namely the inverse vierbein and the gravitino. (If we also write Wm° = €m?Wa%, we 


can collect all e,,¢ factors into a factor of e~! using the ¢€ tensor.) 


The next step is to explicitly solve the curl-free condition on the 4-form in terms 
of the usual scalar superspace Lagrangian. (An alternative is to solve the Bianchis for 
the field strength of the 3-form gauge field, which is also a 4-form.) The procedure 
is the same as that used to solve the Bianchi identities for covariant derivatives (in 
subsection XA2): We start with the lowest-dimension equations and work up. The 
equations that include the constant (vacuum/flat-space) part of the torsion can be 
solved algebraically, the rest give differential constraints. Of course, we will need to 


use the results of subsection XA2 for the torsions and their constraints. The result is 


Tied — Ea” waeal, crabed = t€nd,ocaW L, Loobed _ eiseal Ve =P 3B)L a h.c.] 
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and their complex conjugates, the rest vanishing, where CL is the usual chiral super- 


space Lagrangian (superpotential): 
Vel =0 


(We use the shorthand notation a = (ad), etc.) 


The result is thus the component expansion of the usual curved superspace action 
= pu POE "L+he. 


Using the Bianchi identities of the covariant derivatives one can also covariantize the 


usual solution to the chirality condition: 
L=(V +B)L 
which allows us to identify the action as 
s= fe d*@ E(L+L) 


so L can be taken real without loss of generality for general d‘ integrals. On the 


other hand, for supergravity we can take 
Log = 3, Leg = 0 => Len = sb, Lsq = 0 


or vice versa, and the curvature appears in terms of R,g®® and not Ry? (like the 
corresponding fag without Sag for Yang-Mills), with half as many terms to collect 
(for the same final result). In general, we thus have an expression for S in terms of 
E,,4 and the components of L4gcp, and for the latter in terms of curvatures and 
covariant derivatives of L, which can be evaluated by the same methods as in flat 
space (except that the commutation relations of the covariant derivatives are more 


complicated). 


Components of a superfield are again defined by evaluating its covariant deriva- 
tives at 6 = 0. However, as in the case of global supersymmetry, the value of @ is 
arbitrary, since the result for the action is 6 independent: We therefore will gener- 
ally drop the “|” in component expansions of actions; any superfield then implicitly 
refers to the corresponding component. This 6 independence also means that it is 
not necessary to make any gauge choices: These methods automatically express the 
action in terms of just the component fields that cannot be completely gauged away. 


(For example, F,""| never appears. ) 


Exercise XB2.1 


Collect all the above results: 
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a Find the complete component expression for the most general chiral ([ d?@) 
action in terms of covariant derivatives of the superpotential, and the super- 


gravity fields. 
b Do the same for a real ([ d*@) action. 
Exercise XB2.2 


Evaluate the above actions for massive ¢° theory (for a chiral matter field ¢) 


in terms of the components of ¢. 


Exercise XB2.3 
Do the same for super Yang-Mills: 


a Solve the Bianchi identities for super Yang-Mills in curved superspace. 


b Use this result to evaluate the component expansion of its action. 


3. Component transformations 


We saw in the previous subsection that in component expansions the supergravity 
gauge fields naturally appear as E,,4, since by definition we restrict to the bosonic 
submanifold. Similar remarks apply to the component form of their supercoordinate 
transformations (i.e., local supersymmetry), and the related component expansion 
of their field strengths: In subsection IXB4, we saw that coordinate transformations 
(as applied to solving the radial gauge condition) were simpler for Ey,“ because the 
derivative term on the parameter was just 0,4, kK“. 

We therefore begin by rewriting the gauge (coordinate) transformations in terms 
of E,,“. As for gravity (see subsections IXA2 and IXB4), we can choose to make the 
transformation laws more manifestly covariant by writing the generators in terms of 


covariant derivatives. Then, as for gravity, 
K=K4V,+K'M, 6Va=[K,Va] => 


jE = Vuk =f Kos te ea, SO = =Vupk tea hae 
6(T4p°, Rap’) = K(T4p°, Rap’) 


using (6E4™”)Ey® = —E,MbEy®. Here Vy = Ev4Va = Om + Qu'M1, so these 
transformations on Ey,“ and Q m4 contain only bosonic derivatives Om, other than 
those implicit in the torsions and curvatures. (Similar remarks apply to only Ey,“ 
and 2,‘ appearing on the right.) Since the component expansion is an expansion in 


?, it is really only supersymmetry K°® = e* which is no longer manifest; specializing 
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to that, the transformations on the physical gauge fields become, using the results of 


subsection XA2 for the torsions, 
Sem = iD + Em), — Sthm® = Vne® + tem”? (epGg — &53.B) 


We have not yet determined the solution for the Lorentz connection w,,“ and the 
transformation laws for the auxiliary fields. We will also need the relation between 
the usual curvature and the components of the superfield strengths. All of these can 
be found by use of the identities (see subsection IXA2): 


— En? Em? Te0* _ Tin =] Oma a Big ane 
BPR Rag” = Raa -_ Ouag 4 De ae 


The application of these identities is similar to that for expansion of the action in 
the previous section: The separation of the factors of E,,4 into bosonic and fermionic 
parts yields an expansion in powers of the gravitino field. For the torsion case where 
the index A = a we solve for Wma) in terms of the torsions (auxiliary fields and 
constants) and vielbein; for the torsion case A = a we solve for 7,7, used for the 
transformation law of the auxiliary fields, and for the curvature case we solve for 
R,»%, used in the component expansion of the action, in terms of these auxiliaries, 
the vielbein, and the just-determined connection. The U(1) connection A,, needs no 
solution: It is pure (superscale) gauge, and will cancel in actions (after perhaps an 
appropriate redefinition of G,). For these manipulations we use the relations that 
Tap© and Rap™ have to B, Ga, Wa, W.y, and their derivatives (as expressed by the 


solution to the Bianchi identities given in subsection XA2). The solution is 


A A 


Wmbc = Ca Wate = a lice _ Totea) | Tae = Cer Lag = ean "Ga 17 ida Py) 
Tp’ = 64 65 Via | +e i(WapG” a Yp00B = b) = Capt ss + Castes’ 


6B = —26%tag”, 0Gad = —€? (teoa + 3Coats 5”) + h.c. 


Ag? = ee Ra + Bug ,°) 
— ila (Cat g + 35/05 tye) — Bar gta”? — a 


«On 


where refers to the usual expression for pure gravity, W_? = eq’"Wm", and we 


have chosen the superscale gauge W° = 0 for simplicity. 
Exercise XB3.1 
Find the extra terms for W°% ¥ 0. 
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4. Component approach 


We can now take the superspace action of subsection XB1, as expanded in com- 
ponents by the ectoplasm method of subsection XB2, and substitute the component 
expansions of the field strengths found in subsection XB3, to find the component 
action 

Lsg=Lg+Lyte'l, 


Lg =—leOR, Ly =e™™mah{en®®, Vp}ga, En = —3(Ga)? + 3BB 


(Note the signs are again consistent with G, and B, B forming a 6-vector of SO(3,3), 
though not in the same way as in exercise XA2.6.) Here V and Lg are the usual 
covariant derivative and Einstein-Hilbert action of general relativity in terms of e 
and w, but w is slightly different from any of the connections used previously (see 
exercise XB4.1 below). It also differs from the w given above in that we have explicitly 
extracted the G, piece (which is the sole source of w in the ectoplasm approach). An 
alternative to ectoplasm to determine w is to use a first-order formalism: Rather 
than imposing the usual torsion constraint, we can leave the Lorentz connection as 
an independent field in R and in the V in Ly. Eliminating the Lorentz connection 
by its field equation yields a modified torsion constraint, and produces 7* terms in 
the action. We have written Ly in a form manifestly symmetric with respect to 


integration by parts. (Alternatively, we can write eV — WeVy.) 


As an alternative to deriving the component action from the simpler superspace 
expression, we can postulate the component action directly. In the component ap- 
proach writing the action in components is more direct than the superspace approach 
by definition, but proving supersymmetry invariance is less so. This is not so true 
when coupling to matter, where writing component actions can also be as compli- 
cated as deriving them from superspace, so here we consider the simplest case, pure 
supergravity. We thus begin by postulating Lyg = Lg + Ly; the first term is obvious, 
while the second follows from minimal coupling for the free gravitino action, which can 
be derived easily by many methods (see, e.g., subsection XIIA5 below). We ignore 
the auxiliary fields, which are necessary for off-shell closure of the supersymmetry 


algebra, but not for supersymmetry invariance of the action. 


We write the action for gravity in a form that more resembles the gravitino action 
(see exercise IXA5.5): 


1 -mnpq mle en? engad 


Lg = ye 'R= te 


a, b cd _ 1 
EabedEm En Tie — 8 


= Efe en (Rapa gag — Rup! €ge8) 


oo 
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where we have switched to spinor notation for the curvature (see subsection IXA1) and 
used duality in both vector and spinor notation (see subsection IIA7). (Alternatively, 
we can regard this as the definition of the gravity action.) We then have for the 


variation of this part of the action (after some integration by parts) 
6Lg = ie eT Respat O6s ce — Rnpa’ 5€qpal — (Gaal 1 ase _ (SWra”)Trapal } 
where we have used (see exercise IIIC1.2) 


bin = Vinee 5 il di VaV™ = / dt 0,V"=0 


Next, we pick the obvious transformation law for the gravitino field as the gauge 
field of supersymmetry: 
Wa = Vane’ 
The transformation laws for e and w will be derived as a by-product of the invariance 
proof, as will the explicit expression for w in terms of e and w. Substituting this 


expression for dw into Ly, 


6 Ly = CP ea Vind len, Volta — fa Vm t{en™, Vo}Wen 
7 50 ma (Sen) V poe _ 5Wma(€n*) Vinee 
+ MWmaen®? (Supa?) bag — HY maen®* (Bape?) 
where we have integrated by parts to free the supersymmetry parameters of deriva- 


tives. We then use the antisymmetrization on all curved indices to collect the resulting 


terms into torsion and curvature as 
Vie, V} = {e, VV} + (Ve)V = s{e, R} — 5TV 


The curvature terms then cancel those from Lg, if we choose for de in 6Lg the 
transformation law 
bea _ ae a Ein”) 
Then we also substitute this expression for de in dL,, and note that half those terms 


immediately drop out, since 


Vim Vnja = 0 
by antisymmetry. The remaining terms from both Lg and Ly then can be collected 
as 
dLsq = DTT POA 


je = Lae ad itm Unj° 
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Angad = i(Sipa”)e15, — i(Swpa”)eq\sa + €aV pBaja — ExV Wala 
q\B 


We now note that the former factor in 6Lgq vanishes by virtue of the equation of 


motion from varying the connection: Rather than vanishing, the torsion now satisfies 
second — order : Tone = 0 


We can regard T as the “supersymmetrized torsion”; this is equivalent on shell to the 
result we found in the previous subsection from superspace. We can therefore quit 
now, since in a second-order formalism the torsion (and thus the Lorentz connection) 
would satisfy this equation even off shell. (This approach, using the second-order 
formalism but not bothering to substitute the supersymmetry variation of the con- 
nection, is called the “1.5-order formalism”.) On the other hand, we can just as easily 
recognize that in the first-order formalism cancellation of 6Lsq is also guaranteed by 
allowing vanishing of the latter factor to define the supersymmetry variation of the 


(independent) connection: 


first—order: A = 


pga. 


Thus, use of the first-order formalism requires no more work than 1.5-order (contrary 
to remarks in the literature), which is really the same as second-order, and provides 
the bonus of yielding the transformation law for w. However, it is useful to note that 
not all quantities should have their longer forms substituted at the beginning of a 
calculation (just as we learned in high-school algebra not to plug in numbers till the 
end). 
Exercise XB4.1 
Let’s complete this calculation to the bitter end, finding all the properties of 
the connection: 


a Solve the torsion constraint for w (see subsection IXA3). 


b Find the transformation law for w that follows from cancellation of the above 


terms off shell (i.e., without imposing the torsion constraint). 


c Show the above two results are consistent (modulo terms with w field equa- 
tions, which can be canceled by contributions from auxiliary fields) by plug- 
ging the expressions for de and dw into the variation of the result for part a 


and comparing with the result for part b. 


d Compare these results with the connection found in the previous subsection. 


How does the appearance of G, affect the transformation law? 
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5. Duality 


Although antisymmetric tensor gauge fields can be avoided in general, they tend 
to turn up in string theory, so we now look at them a little more generally, examining 
their actions and how they relate to those for scalars. In particular, we note that 
a sensible action for such a tensor alone cannot be constructed that is conformally 
invariant: From the same analysis as for electromagnetism or Yang-Mills (subsection 
IXA7), we see that (F,)? does not give a scale-invariant action in four dimensions. 
Thus, such a field is not suitable as a compensator for pure gravity. However, in 
supergravity the tensor multiplet (see subsection XA3) also has an ordinary scalar, 
and an appropriate power of it can make the tensor’s action conformal. Therefore, 
we now examine general duality transformations for the supersymmetric case, which 
is more relevant for understanding its use in gravity. We will consider explicitly a 
flat superspace background for simplicity, but generalization to curved superspace 
by covariantization is straightforward, replacing flat superspace derivatives with (su- 
perconformal) covariant derivatives, introducing supergravity field strengths where 
necessary (in this case, just d? > vy + B for chirality), and using the covariant 


integration measures. 


Duality transformations can be performed directly in the action by use of first- 


order formulations. Starting with the general tensor multiplet action 
See pu d*@ K(G) 
where K is some function and G = d*@,, + h.c., we write this in first-order form as 


So / dx d*6 [K(V)+ VG] 


where V is an unconstrained real superfield and K is the Legendre transform of K: 
For this action to reduce to the previous upon applying the algebraic field equation 
of V, we must have 


IK(V) + VGllozm__g = K(G) 


G 
The duality transformation is then performed by varying ¢, instead of V in S¢: 


Remembering that ¢, is chiral, so 
6 ; d‘z d*0 VG =3 / d*x d?0 (6¢%)d?daV + h.c. 
we solve the condition on V as 


@dV=0 => V=¢+¢ 
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since thinking of V as the prepotential for a vector multiplet says that it is pure 


gauge. The dualized action is then 
So = ux d‘6 K(¢+¢) 


We can also reverse the procedure through another first-order action 


si, = fu d*6 [K(V) —V(o+)] 
where in this case varying with respect to ¢@ implies 
V=0 => V=C 


while varying with respect to V gives the inverse Legendre transform 


[K(V) -Vd+ @)]loxn_4.3=K(b+ 4%) 
OV 
The simplest case is the Lagrangian 3G": We then find 
K(V)=4V?, & =K(V)=-4V’ 


so the duality is 
Lim =3@ @ Lg =—35(¢+ 4)” 


In flat space, this gives the usual free result L, = —¢d, but in curved space the 
—4¢’+h.c. part does not vanish because E~! is not chiral. Consequently, this action 
is not the conformal one (as we already knew from the component argument above). 
However, the conformal one is easy to find by starting with —¢¢: Making the field 


redefinition ¢ — e® expresses the action in terms of ¢ + ¢. Legendre transforming, 
K(V)=-eY & K(V)=V(inV—1) 


so the duality is 


Lg= et? & Lim=G(InG-1) 
These two conformal actions for matter, when coupled to conformal supergravity, 
become the two “minimal” actions for supergravity, when the overall sign is changed 
to make the matter fields into compensators: The version with @ as the compensator 
is called “old minimal”, while that with @, is called “new minimal”. They differ only 
off-shell, in their choice of auxiliary fields. Note that the field equations for the two 


conformal multiplets, 


P’b=0 (P=e?%), @din G=0 


694 X. SUPERGRAVITY 


reproduce the compensator part of the supergravity field equations B = 0 and Wo =) 
described in subsection XA4. Again, the full expressions follow from the usual su- 
pergravitational and superscale invariances, which were used to find B and Was the 
compensator dependence is enough to identify them as the appropriate covariantiza- 


tions. 


Exercise XB5.1 
We saw in subsection IVC5 for the Chern-Simons form, or XB2 for ectoplasmic 
integrals, that differential forms can be defined in superspace. Do the same 


for the tensor multiplet: 


a By generalizing the bosonic case to superspace with curved indices, and then 
“flattening” the indices (as for the Chern-Simons superform), show that the 
super 2-form Byg with field strength Hagc and gauge parameter A, is de- 
scribed by 


6Bap = VisApy — Tas©Xc, Hasc = $V ABscy — $148)" Boo) 


(Hint: Show that replacing E4 — V4 and Cyg? — T,4p° yields only cancel- 


ing connection terms.) 


b Show that the torsions given in subsection XA2 satisfy 
: deg FH st 


Note that this also implies the gauge invariance of the Chern-Simons form of 


the super Yang-Mills action in curved superspace. 


c Show that the constraints 


Hopy = Hopy = Hape=0, Hyg = —iCayCy,G 
(and complex conjugates) can be solved by 
Bag = Bap a 0, By 95 = —1C'5 3083 Vade = 0 


Bas = Cygbaa + Caabay, ba = 3V(adai = G = 2(Vad" + Vad") 
Relate the results for the gauge fields Bap to those for the Yang-Mills field 
strengths F'4p (subsection IVC3). 


d Supersymmetrize the construction of exercise XA3.1: Show one can define a 
field strength 


Haso = Haso + Base 
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using the Chern-Simons superform Bygzo. 


The supergravity component action with the tensor multiplet compensator differs 
from the one of the previous subsection in that B and the longitudinal part of G have 


been replaced by the gauge field B,,y: 
Lg — de™"4Cn On Boy 


which is the only possibility that preserves the gauge invariances of both G and B 
while leaving them both auxiliary. (Their field equations are that their field strengths 


vanish. ) 


6. Superhiggs 


Supergravity affects spontaneous supersymmetry breaking in a simple way: From 
the discussion of the immediately preceding subsections, we know that supergravity 
can be described more simply as conformal supergravity coupled to a compensator. 
Simple (N=1) conformal supergravity contains no scalars: It consists of only confor- 
mal gravity (the traceless part of the metric), the conformal (traceless) part of the 
gravitino field, and an auxiliary gauge vector. Since symmetry breaking involves giv- 
ing vacuum values to only scalars, we can replace supergravity by just its compensator 


for these purposes. 


For a general analysis, consider a kinetic term 
oS / dx d*0 36ge~KO-X0/8 


(The exponential form will prove convenient for later component analysis.) This is 
the most general kinetic term with the usual number of spacetime derivatives: Any 
term of the form f(¢,¢, x’, X:) can be rewritten in this form after appropriate field 
redefinitions. In particular, if we start with fields with arbitrary Weyl scale weight, 
then this form follows after rescaling fields so only @¢ carries scale weight, since all 
terms in the Lagrangian must have the same scale weight, fixed by (super)conformal 
invariance. @ is then the only field to carry U(1) weight, which is proportional to 
scale weight by superconformal invariance. Then ¢ appears only as ¢¢, while K is an 
arbitrary function of x‘ and y;. The first step in evaluating this action in components 
is to simply ignore conformal supergravity altogether, and evaluate this action as is, 
in terms of matter and compensator multiplets, by the methods we have considered 
previously for evaluating @ integration. The next step is to add back in some parts 


of conformal supergravity: 
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(1) the conformal graviton, which can be put back in easily and uniquely using coor- 


dinate and local scale invariance; 


(2) the U(1) axial gauge vector, whose coupling is minimal, and thus follows directly 


from U(1) covariantizing the spacetime derivatives; and 


(3) the conformal gravitino, whose quartic couplings can be quite complicated, but as 
a practical matter we are interested in only the mass term, which is determined 
from the mass of the Goldstone fermion it eats, which appears in the compensator 


(and the kinetic term, which is the usual one). 


Before considering the general case, we look at the pure supergravity case under 


this analysis: Looking at just the bosons, we find 
Ssc.o = fe e”'3{-3[(V — ifA)d] - [(V + ig) 4] — $6¢6R + OBB} 


where V is the usual covariant derivative of general relativity, A is the U(1) gauge 
vector, and the relative coefficient of the R term was fixed by local scale invariance 
(see subsection IXA7). Note that here B is the usual auxiliary field from ¢, and is 
not associated with conformal supergravity. Choosing the component U(1) and scale 


gauges ¢| = 1, this reduces to 
Ssa,b fu e '(-4R -_ 4A? + 3BB) 


Relating to G,, we recall that if we had included it from conformal supergravity, 
for this compensator G.=C¢.> 2 Aa, so we can identify A, with 2Ga. Thus, the 
compensator method immediately yields the bosonic action, including auxiliary fields. 

Returning to the general case, the part of the action for the “physical” scalars (| 


and x|) then starts out as 
Stipe = f da eK E310 — i A)A] -[(V + ihA)d] + HIV + HA) O]-(OKIVE 


+9[(V — ig A) 9] - (AK)VX' + $4[(0:07K) — 3(0,K)(0'K)|(VX;) - (Vx') — 56GR} 
ignoring until the following subsection the auxiliary scalars, which are irrelevant for 
the kinetic term. We use the notation 0; = 0/Ox', 0° = 0/O0xX;. We then choose the 
U(1) and scale gauges 
| = eK (x'lxiD/6 
where we have explicitly written the |’s to emphasize that this is a nonsupersymmetric 
gauge choice for the component ¢|. Finally, we eliminate A by its algebraic field 


equation. We thus obtain 


Gies / de e!4[(0,5°K)(Vx%;) - (Vx?) — BR] 
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Except for the R term and covariant derivatives, this is what would follow in flat 
superspace from the action — [ dz d‘@ K. For supersymmetry breaking, we also need 


the super cosmological term 
oe = fu d?0 \o? + hic. 


for some constant . We could consider more general potentials ¢3e/') (again the 
power of @ is fixed by scale and U(1)), but then the field redefinition @ — def/ 
would remove it while replacing K > K + f + f. (This invariance, and the form of 
the “metric” on the space of fields y and Y appearing in the action, identify K as a 
“Kahler potential” .) 


The analysis for Sx can also be made by performing a duality transformation on 
the compensator. Following the same steps as described in the previous subsection for 
the case without matter (factoring the overall —3 out of the process for convenience), 
we find 


See - fav d'0 [3G In G+ GK(x,X)] 


Since in this form A decouples, the result is obvious from the flat-space result. 


Exercise XB6.1 
Repeat the above analysis using the compensator G: Evaluate explicitly all 
the contributions from the bosons in G, couple A, find the R term, show the 


result is the same. 


Normally any kind of symmetry breaking will generate a cosmological term, since 
a scalar getting a vacuum value implies the potential itself getting one, giving a term 
J dz e-*constant. This would require adding a cosmological term to the action by 
hand to cancel the generated one, since the constant generated would correspond to 
a subatomic length scale, whereas a realistic cosmological constant requires a cosmo- 
logical length scale, which means a constant, going as 1/length?, of the order of 10~*° 
in subatomic units. An exception is when the potential is flat in some direction: In 
supersymmetry energy is always positive, and the supersymmetric vacuum has zero 
energy, but some potentials allow other, perhaps nonsupersymmetric, vacuua that 
also have zero energy, and thus generate no cosmological constant. This avoids the 
ad hoc procedure of “fine tuning” the cosmological constant of an added term for 


exact cancellation (or at least to order 10~°°). 
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7. No-scale 


A useful example of the superhiggs effect with a flat potential is “no-scale super- 
gravity”. This theory has an explicit super-cosmological term, but the kinetic term 
is such that this term does not generate a component cosmological term, but does 
spontaneously break supersymmetry. The simplest example describes supergravity 
coupled to a single chiral scalar multiplet. The kinetic term has an SU(1,1) symme- 
try, and also appears in N=4 supergravity (see subsection XC6 below). Written in 


terms of just the compensator part of supergravity, it is 


Se = pu d*0 3(éx + X¢) 


where @ is the compensator and x is the matter. We have written it in a manifestly 
U(1,1) covariant form, where the U(1,1) metric is off-diagonal (($ = instead of the 


usual diagonalized CG a): For the above component analysis we redefine 


picnine Sx f de a8 360(x + x) S KS=3esey 


(Many other superfield redefinitions are possible to put this in more conventional 
forms, such as (36¢— Xx), ¢¢(3 — Xx), etc.) The kinetic term for the physical scalars 
follows from the same analysis we applied to the CP(1) model in subsection IVA2. 
The only differences here are: (1) the symmetry is U(1,1), not U(2), and (2) the 
constraint on the norm of the complex 2-vector follows not from a Lagrange multipler 
(or a low-energy limit), but as a local scale gauge chosen to give the Einstein- Hilbert 
curvature term the usual normalization. Alternatively, we can use the analysis given 
in the previous subsection for the general case to find 
-11 |Vxl? 1 
Ska = fe es pee = a 

However, to study just the supersymmetry breaking, we want to look at the “po- 
tential” terms: terms that involve the auxiliary scalars instead of spacetime deriva- 
tives. We thus now need to include the super cosmological term, which breaks the 
SU(1,1) invariance. Again evaluating at first without conformal supergravity, then 
putting some (all but the conformal gravitino) back in, we find the contributions from 
Sr and S, 


Sour = pu e'3[BB(x + x) + (Béb+ Bob) + (Bd? + B”)| 


where B = d?¢ and b = d?y. We then see that eliminating the auxiliaries gives 
nothing, so there is no potential to generate a cosmological term. However, there is 


still a mass term for the gravitino: As always, S, also contains the spinor term 


6A($C2 + hac.) 
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where ¢, = dg¢@ is the trace of the gravitino. The gravitino in this model therefore 
has a mass proportional to \(y¥ + y)~1/”. 


Exercise XB6.1 
Explicitly evaluate the spinor part of the kinetic term, and thus determine 


the exact value of the mass of the spinor, and thus the gravitino. 


SU(1,1) invariant kinetic terms also appear in superstring theory, but unlike N=4 
and no-scale supergravity, the kinetic term is (¢y + ¥@)'/3 instead of just dy + Xo. 
(See subsection XIA6.) When applying no-scale supergravity to nature, more matter 


multiplets are added, 
Se fu d*0 3(dv + xO — Kix’) + ¢ dx 26 rAp8ef0C/9) + he.) 


generalizing SU(1,1) to SU(n,1) in the first term. (N=5 supergravity has such an 
SU(5,1) symmetry; see below.) Then y acts as the “hidden” matter sector that doesn’t 


directly couple to the observed matter x‘, but serves only to break supersymmetry. 
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sueesiioauaneat C. HIGHER DIMENSIONS ................. 


A convenient method for describing extended supersymmetry in D=4 is to ap- 
ply dimensional reduction to supersymmetry in D>4, since (1) spinors are bigger in 
D>4, so even simple supersymmetry reduces to extended supersymmetry, and (2) the 
Lorentz group is bigger in D>4, so some 4D scalars arise as parts of higher-D vectors, 


etc., meaning fewer Lorentz representations in the multiplet in D>4. 


1. Dirac spinors 


We saw in subsection IC1 that coordinate representations of orthogonal groups 


SO(D) could be defined in terms of self-conjugate fermions, 


Ca = slYas Yl, 155 = Vas 


We now will construct explicit matrix representations of the Dirac matrices for arbi- 


trary D, and examine their properties. This is useful for understanding: 
(1) representations of internal symmetries, such as in Grand Unified Theories; 


(2) theories in higher dimensions, which give simpler formulations of certain four- 
dimensional theories when the extra dimensions are eliminated, and appear in 


string theory; and 


(3) properties of spinors that are independent of D, or their dependence on D, which 


is useful for comparison and for perturbation in quantum field theory. 


An explicit solution can be found easily by first looking at even dimensions, and 
breaking up the problem into D/2=n two-dimensional problems. Furthermore, we can 
look first at the Euclidean case (SO(D)), and solve for the other cases (SO(D,,D_)) 
by Wick rotation. The solution for SO(2) is just two of the Pauli o matrices. The 
general solution then comes from the direct product of the two-dimensional cases, 
using the third o matrix to introduce appropriate “Klein factors” (see exercise [A2.3) 
to insure that the y matrices from one two-dimensional subspace anticommute with 
those from another. The resulting y matrices are then: 

1 1 


v2 v2 


where i = 1,2, there are a total of n factors, and the number of 203 and J factors 


(V203 @ --- ® V203 ® V20; @1@---@ 1), (V203 @ --- ® V203) 


in the first expression ranges from 0 to n—1. The last matrix can always be included 


to extend SO(2n) to SO(2n+1); in fact, up to normalization, it’s simply the product 
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of all the other y’s. (In other words, the product of all the y matrices is proportional 
to the identity.) 


Exercise XC1.1 
Apply exercise IC1.2 to this construction: Show how this representation re- 
lates simply to creation and annihilation operators. Show that these Klein 


factors are identical to those of exercise IA2.3. 


The next step is to notice that this construction generally gives a reducible rep- 
resentation. Reducibility comes from two properties: (1) For SO(2n) we really have 
SO(2n+1); and (2) the representation may be real. In fact, most of the interest- 
ing cases involve SO(2n) (in particular, SO(3,1) for Lorentz and SO(4,2) for confor- 
mal in four dimensions). In that case we can call the first (or any other) y matrix 
(0, ®@1@---@T) for SO(2n+1) “y_1”, and take the rest as those for SO(2n). Then 


the projection operators 


Hee*layay4) = Taig, Ali, S06, 1. f= 1 


commute with the SO(2n) generators Ga, ~ yy, so they can be used to project the 
representation of the y’s into two representations of SO(2n). These two halves of 
a Dirac spinor are known as “Weyl spinors”. A convenient representation of the y 
matrices for this purpose is the one given in subsection IIA6, with the representation 
of the Pauli matrices used in our SU(2)/SL(2,C) discussion of subsections ITA1 and 
9, 
=H) m= ACs =H) 

We then can write the spinor, which has 2” components (since it represents the direct 


product of n representations of o matrices, each of which has two components) as 


two 2”-!-component spinors projected by 


Hy, =04 818... @f, Oey C= (57) 


The y matrices then take the block-diagonal form 


Y1= yalor), other y= (25) 


We will refer to these reduced matrices o (and o), and the y matrices themselves for 


SO(2n+1), as generalized Pauli (0) matrices. 


The reality properties of the representation depend on the existence of a metric 
n ia (or 2 ae for psuedoreality, which doesn’t reduce the representation), as in our 


discussion of classical groups of subsection IB5. In fact, all the spinor representations 
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of any orthogonal group are also defining representations of another group: For less 
than seven dimensions, this leads to the identification of covering groups discussed in 
subsection IC5; for more than six dimensions, it only identifies the orthogonal group 
as a subgroup of this new group. (An interesting exception is SO(8), where the spinor 
representations are also 8-dimensional, and are the two other defining representations 
of SO(8).) In matrix notation, we look for a matrix C = 7 or (2 such that we can 


define the operation of charge conjugation as 
yoC lw, Gu =clGy)* = G=Cc 'G*C 


If we like, we can also choose 
C=C'\=c 
without loss of generality. For a representation to be invariant under charge conju- 


gation (i.e., real) 
C=C wr = CrSaC 


For our y matrix representation, the matrix to look at is 
C=... & CoV 203 & Co & Con 26 &) Co 


where 

C= (50) 
independent of the representation used for the Pauli matrices. (Our representation is 
simplest, since then Cy/2o3 = I, and C' = C’. In other representations, C’ may also 
need an n-dependent: factor of i if we want C = Ct.) Using properties of 7 matrices 
we found in our discussion of SO(3) in subsection IIA2, such as Coa0*C2 = —o, we 
find 


Co1y*C _ (-1)"9, Ct = Ca emer cr = (anor, ct = Cu} 


This distinguishes 8 cases, where the irreducible spinors are: 


SO(8m): Wey] and real 
SO(8m-+1) real 
SO(8m-+2) Weyl 
SO(8m+3): pseudoreal 
SO(8m+4): Weyl! and pseudoreal 
SO(8m-+5) pseudoreal 
SO(8m-+6) Weyl 

( ) 


real 


704 X. SUPERGRAVITY 


For SO(4m+2), charge conjugation does not preserve 7_;, and thus //,. Therefore, 


in those cases there is no metric 7 Te or 2 i on the irreducible spinor: The Dirac 
spinor consists of two irreducible spinors that are complex conjugate representations 
of each other. In general, a Dirac spinor has 2” complex components for SO(2n) and 
SO(2n+1); the Weyl condition reduces this a factor of two for SO(2n), as does reality 
where applicable. (Pseudoreality does nothing.) 


It is useful to know the other group metrics, if they exist. For unitarity properties 


we look for a metric 14% such that 
G=-T°GY = ray 
(Thus g = e© satisfies Yg-! = g'V.) We can also choose 
jie a 
without loss of generality. We therefore look for a metric satisfying 
Py ay 


so G ~ |y,97] is antihermitian with respect to Y. For SO(D), we have simply 


BR 
II 
— 


since the hermiticity of the 0 matrices implies that of the y matrices. We then can also 
define a metric to raise and lower indices in terms of these two metrics, by contracting 


the dotted (or undotted) indices: In matrix notation, we then have 
(TN ey) als = CHC ee) 


For all cases except SO(4m), this also defines the symmetry properties of the gener- 
alized o matrices: They can be defined as CTY y for SO(2n+1), and as its diagonal 
blocks with respect to [7/4 for SO(4m+2); but for SO(4m) it’s off-diagonal, so the 


generalized 0 matrices appearing there carry one each of the two different kinds of 


spinor indices, and thus have no symmetry. Then we rewrite the above result as 
(CTY)? = (1) VP(CTYy) 
2. Wick rotation 


The indefinite-metric groups SO(D,,D_) (D, 4 0 4 D_) can be treated by 


Wick rotation: giving z’s to D_ of the y,q’s, so the corresponding components of 
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Nab get minus signs. Since this affects y7 in the same way as 7, the metric C7Y is 


unchanged. In other words, 
OvT=C. (eS) 


in terms of the Euclidean C of the previous subsection. However, y* and y' are 
affected in the opposite way to y and y7 (—i’s instead of 7’s). Y then becomes (up 


to normalization) the product of the timelike 7’s, 


r~ [J v2% 


Naa<O 


which also determines the modification of C. In the above equations for y' and y7, 
we then find a factor of —1 for each rotated dimension, coming from anticommutation 
with each timelike y in Y, so we redefine all y’s by an overall factor of i?- to preserve 


their pseudohermiticity. This changes the normalization to 


Goa =(-1)? sla %)l, {Yar WP = (—1)? “Nas 


In odd dimensions the o matrices are the y matrices (up to multiplication by one 
of the metrics), while in even dimensions the y matrices consist of two off-diagonal 
blocks of the o matrices. To write actions we also need the “dual” Dirac spinor, in 


the sense of a Hilbert-space inner product, 
w= wy 


with Y as defined above. In particular, it is just 27 in D_ = 1. 


We then find (e.g., using the explicit representation given above) that C’ has the 
same properties with regard to symmetry and 7_; for SO(D,,D_) as for SO(D, — 1, 
D_-— 1). Thus, the properties of these metrics on the irreducible (as opposed to 


Dirac) spinors follow easily from the Euclidean case by using 
Crs SO(D,,D_) — SO(D, + D_) 


Ct £00. D)=600.205 


Then the properties of Y follow from the above two in the cases where all 3 exist; 
the few cases where only Y exists, which have D even, follow from the next higher D 
(increasing D, by 1). 

Exercise XC2.1 


Find the explicit y matrices for D = 2 and 4 from the construction of the 
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previous section, and apply this Wick rotation. Compare the results with the 
conventions of subsections VIIB5 and ITA6. 


We can use these properties to determine that the number of real components D’ 


of an irreducible spinor is 


xmod8& |0 1 
Di gIP-eti@ pe f(x) 01 2 


The complete results can be summarized by the following table, showing for each 
case of SO(D — D_,D_), for D mod 8 and D_ mod 4, the types of irreducible spinors 
w, the types of metrics 7 (symmetric) and (2 (antisymmetric) for these irreducible 


spinors, and the type of generalized o matrices (and its symmetry, where relevant): 
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Exercise XC2.2 
The vectors of SO(D,,D_) for 3< D, + D_ <6 were expressed as tensors 
with two spinor indices, with the appropriate symmetry and tracelessness 


conditions, in subsection IC5. 


a Show that the spinor metrics found from the Dirac analysis are sufficient to 


identify each of these covering groups. 


b Show the equivalence of each orthogonal group to its covering group by show- 
ing that (1) the Lie algebras have the same dimension, and (2) the deter- 
minant (or its square root) of this tensor gives the appropriate orthogonal 
metric. (Hint: Do the cases D, + D_ = 4 and 6 first, and specialize to 3 and 
DB.) 


Exercise XC2.3 
Consider the groups SO(n,n) and SO(n+1,n). Explicitly construct a real 
representation of the y matrices by modifying the method of subsection XC1, 


demonstrating that all such spinors are real. 


Besides the Dirac spinors and Dirac matrices y, and the irreducible spinors and 
Pauli matrices a, it is also useful to introduce irreducible real (“Majorana”) spinors 
and corresponding matrices J’. When the irreducible spinors are already real these 
are the same, but when the irreducible spinors are complex this real spinor is just the 
direct sum of the irreducible spinor and its complex conjugate, a spinor with twice as 
many components. In general, these generalized Majorana spinors and matrices have 


many properties that are independent of the number of dimensions, but depend on 


the number of time dimensions: 


Tt action | toms feotmat |? 


For D odd, there is only one irreducible spinor, so there is a metric M°? or M°” 
to relate the two spinors listed. For D — 2D_ twice odd (2 mod 4), the original 
irreducible spinor was complex, so there is a metric representing a U(1) generator 
that rotates the complex spinor and its complex conjugate oppositely. (L.e., it’s the 
identity on the complex spinor and minus the identity for the complex conjugate.) 
For D — 2D_ = 3,4,5 mod 8, the original spinor was pseudoreal, and this U(1) 


can be extended to an SU(2): Since the complex spinor and its complex conjugate 
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transform the same way under the orthogonal group, they can be paired as a doublet 
of SU(2). This doubled representation is a real representation of the orthogonal 
group ® SU(2), since the direct product of the two antisymmetric charge-conjugation 


matrices is symmetric (two —’s under transposition). 


3. Other spins 


Before considering supersymmetry in higher dimensions, we first study represen- 
tations of the Poincaré group there. From the general analysis of section IIB, we 
know that general on-shell representations follow from the massless ones, which can 
be classified by their representation of the lightcone little group SO(D—2). Specifi- 
cally, the bosons can be described as traceless tensors of a certain symmetry (labeled 
by a Young tableau), while the fermions can be labeled as the direct product of such 
tensors with an irreducible spinor, with a tracelessness condition imposed between 
any vector index and the spinor index using a y or g matrix. Similar methods can be 
used to find the off-shell representations in terms of representations of SO(D—1,1), 
but without subtracting traces. (For full details, see chapter XII.) The gauge degrees 
of freedom can be subtracted from these Lorentz representations by dropping all lower 
vector indices with the value “—”, by the usual lightcone gauge condition; this tells 
us the number of total physical + auxiliary degrees of freedom. 

In practice, the only interesting massless fields in higher dimensions are: 

(1) the metric (graviton), 

(2) totally antisymmetric tensors (including scalars and vectors), 
(3) spin-3/2 (gravitino), desribed by vector®spinor, and 

(4) spinors. 


By the methods described above, the counting of physical, auxiliary, and gauge de- 


grees of freedom for these fields is (where D’ is the number of components of an 


irreducible spinor of SO(D—1,1) — see the previous subsection): 


0 
1p'(D +1) 
xD! 
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Exercise XC3.1 
Derive all the entries in the table. For each type of field, find the minimum 
D for which physical degrees exist. 


We next consider exactly how many higher dimensions are relevant. From the 
previous subsection, we see that an irreducible spinor (which we use for the super- 
symmetry generators) has 1 component in D=2, 2 in D=3, 4 in D=4, 8 in D= 5 or 
6, 16 in D=7, 8, 9, or 10, 32 in D=11, etc. Since the maximal Lorentz symmetry can 
be obtained by looking at the maximum D for which a certain size spinor exists, we 
see that the appropriate D for which an irreducible spinor reduces to N irreducible 


spinors (for N-extended supersymmetry) in D=4 is 


etc. From the discussion of subsection IIC5, we know that supergravity exists only 
for N<8, and super Yang-Mills only for N<4. This means that simple supergravity 
(i.e., any supergravity) exists only for D<11, and simple super Yang-Mills for D<10. 
Since theories with massless states of spin>2 are not of physical interest (in fact, no 
interacting examples have been constructed), we can restrict ourselves to looking at 
just D=4, 6, 10, and 11. In general, an irreducible multiplet in some D can become 
reducible in lower D. However, since irreducible multiplets of supersymmetry are con- 
structed as the direct product of the smallest representation of supersymmetry with 
an arbitrary representation of the Poincaré group, this reducibility corresponds di- 
rectly to the reducibility of that Poincaré representation, which occurs simply because 
the Lorentz group gets smaller upon reduction. In particular, the smallest represen- 
tation of supersymmetry is itself irreducible. For the case of simple supersymmetry, 
this is the scalar multiplet (scalars and spinors) in D=6, the vector multiplet (super 
Yang-Mills: vectors, spinors, and scalars) in D=10, and supergravity in D=11. The 
statement that it is the smallest multiplet in that number of dimensions is directly 


related to the fact that it does not exist in higher dimensions. 


A. Supersymmetry 


We first generalize to arbitrary dimensions some definitions used earlier: To dis- 


cuss the properties of supersymmetry that are common to all dimensions (but one 
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time), it’s most convenient to use the Majorana form 


{da, Ia} = TypPa 


which is consistent with the general symmetry of these matrices. The supersymmetry 
generators are then 


7) 4 9 
fa = 190 + aoe ae 


and €“qq generates the infinitesimal transformations 


b0% = &®, bx" = 15 51S 3° 6° 


where (qq)! = —qq. The covariant derivatives are 
0 1 pra 
da = a + $1530" Da 


and they satisfy the same algebra as supersymmetry 


{da, da} = PyigPa 


but with the opposite hermiticity condition (d,)' = +d,. The invariant infinitesimals 
are 


do”, dx* + i5(d0°)I2,0° 
Superfields can be expanded as either 
P(x, 8) = O(x) + Mala) + 


or 


giving the transformations 
6b = €°, 6%, = ie SIO. + ..., 


Representations can be found as for D=4; we don’t have twistors in general, but 
we can always use a lightcone frame. We first need to define °°’, which in general 
is independent of I%, (only the latter was needed to define supersymmetry above): 
The analog of the Dirac anticommutation relations (which can be reconstructed if we 


combine the two Is, as generalized o’s, to form a generalized y) is 
(a pxb)yB _ , ab sB 
‘ied i 2 = 7) On 


In the lightcone frame the momentum is just p* = 67 p* with pt = +1 being the sign 


of the (canonical) energy. In this frame we have the constraint [’~q = 0. This projects 
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away half the q’s, since —I‘*I"* are projection operators: Using the anticommutation 


relations of I=, 


Ti. = —r=rF = I,” = 14, HI = Toi. =O, fi a SI 


The equality of the sizes of the two subspaces follows from parity symmetry, [~ < 


I’*. We thus need to consider only half of the q’s, namely tq. We therefore switch 
to a notation where we consider the truncated spinor q, with just that half of the 
components. This “lightcone spinor” is an irreducible spinor of SO(D—2). In a 


Majorana basis it satisfies the same commutation relations as Dirac matrices, 


{Ops qv} = Oa 


Since q, has an even number of components (2”, n > 0) in D > 3, the states that 
represent this algebra form a Dirac spinor of SO(2") that is reducible to two Weyl 
spinors. (These spinors should not be confused with those of SO(D—2), such as q,, 
which is a vector of this SO(2”).) Since supersymmetry takes each of these “spinors” 
into the other, one spinor contains all the bosons, while the other contains all the 
fermions. There are an equal number of physical boson and fermion states because 
the two Weyl spinors are equal in size. Since SO(D—2)CSO(2”), each Weyl spinor 
of SO(2”) is reducible with respect to SO(D—2). The only exceptions are (1) D=4, 
where SO(D—2)=SO(2”)=SO(2), and there is one bosonic state and one fermionic 
one, and (2) D=10, where SO(D—2)=SO(2")=SO(8). 
Exercise XC4.1 


Let’s look more closely at these exceptions: 

a Show that SO(D—2)=SO(2") only in D=3,4,6,10. 

b Show that in D=6 the bosons form a reducible representation of the little 
group SO(D—2). How is this possible, when the group SO(2”) is the same? 

c For D=10, what representations of the little group are the bosons and the 
fermions? Compare this to the representations of SO(2”) formed by the 


bosons, fermions, and q itself, and apply this “symmetry” to the cases D=4,6. 


This “Dirac spinor” of SO(2") is the smallest representation of supersymmetry. 
It can also be represented in terms of anticommuting coordinates, by dividing up q,, 
into two halves, one of which is complex coordinates, the other half being both the 
complex and canonical conjugate (as for the fermionic harmonic oscillators of exercise 
1A2.3). The most general representation of supersymmetry is then the direct product 


of this one with an arbitrary representation of the Poincaré group. 
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All the results of this section can be extended to “extended supersymmetry” , 
with supersymmetry generators qj. for an N-valued “internal” index 2, as expected 
from our discussion of supergroups in subsection IIC4: For example, in D=4 the 
supergroup describing extended conformal supersymmetry, SU(2,2|N), includes con- 
formal symmetry SU(2,2), internal symmetry U(N), N supersymmetries, and N S- 


supersymmetries. In general, the supersymmetries then satisfy the algbera 
{ies qa} = dij SaPa 


The smallest representation of an extended supersymmetry follows as before, where 
now the complete lightcone q acts as Dirac matrices for SO(N2”). Other representa- 
tions are again found by direct product, now between this smallest supersymmetry 
representation and an arbitrary representation of both Poincaré and the internal sym- 
metry. For the more interesting cases, where N itself is a power of 2, the smallest 
representation can also be derived by dimensional reduction from higher dimensions 
of N=1 (“simple”) supersymmetry, changing the higher-dimensional algebra only by 
setting some components of the momentum to vanish, and noting that a spinor of 
higher dimensions reduces to many spinors, as clear from our explicit construction 
earlier. (Other representations tend to be reducible, since the Poincaré representation 
in the direct product is reducible upon dimensional reduction.) Dimensional reduc- 
tion can also be defined for an action (for supersymmetric or nonsupersymmetric 
theories), by again setting the derivatives with respect to the “extra” coordinates to 
vanish, and also restricting the integration to the reduced set of coordinates. An- 
other interpretation is that we expand the fields over all momentum modes in the 


extra coordinates, and then drop all but the zero (constant) modes. 


We also recall from subsection XC2 the index structure of spinors in D=6, 10, 
and 11, which we need to write supersymmetry covariant derivatives. We thus have, 


for simple supersymmetry, 


D=6: 1 aes dg} = —Cit0o8 
D=10: {do,dg} = —0%gid, 
D=11: {da, dg} = —05810. 


where in the case of D=6 we have taken advantage of the fact that SO(5,1)=SU*(4) 
to eliminate vector indices, and introduced the SU(2) index 7 for spinors to make 


them Majorana. 
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5. Theories 


We first consider the scalar multiplet in D=6. The constraints and field equations 
are given by the statement, in terms of supersymmetry covariant derivatives, that 
there are only scalars and spinors on shell, and by supersymmetry their physical 
polarizations must be equal in number. Since a spinor has 4 polarizations in D=6, 


we must have 4 real scalars, and thus 
diahjk! = Chika 


The second SU(2) index k’ is introduced again to make a spinor (this time the field) 
Majorana, and performs a similar service for the scalars. This one equation is suf- 
ficient to completely describe this multiplet on shell in the free case; interactions 
require derivatives, so we won’t consider them here. This multiplet reduces to N=2 
in D=4 in a very simple way: The SU(2) index on d labels the 2 supersymmetries, 
and the 4-component spinor index reduces in the obvious way to SL(2,C) indices, 


a — (a, a), with appropriate 6D spinor conventions. 


Exercise XC5.1 
Show the equations given for the 6D scalar multiplet give the complete field 
equations for all the components, and that only the scalars and spinors shown 


explicitly in that equation survive on shell. 


This six-dimensional theory gives a simple example of nontrivial dimensional re- 
duction: Assume we have a 5-dimensional theory with a nontrivial U(1) symmetry. 
Then we can dimensionally reduce by choosing the fields to depend on the fifth coor- 
dinate in such a way that the fifth component of the momentum of each field is equal 


to a constant m (with dimensions of mass) times its U(1) charge Q: 
pa = Z=mQ 


This is consistent at the interacting level because each term in the action satsifies 
conservation of the U(1) charge as well as conservation of momentum. This is equiv- 
alent to how we introduced masses by dimensional reduction in subsection IIB4 for 
free fields, since any free field can be “complexified”. This has an interesting effect 
on the supersymmetry algebra: It introduces a U(1) charge Z (called “central” be- 
cause it commutes with the rest of the algebra). For example, if we start with the 
6D supersymmetry algebra (like the above algebra for the supersymmetry covariant 


derivatives), introduce the central charge in reducing to 5, and then do an ordinary 
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reduction to 4 (or vice versa), the supersymmetry algebra becomes (see subsection 
IVC7) 


1 Gia 7} = 5D: {diay G38} = CapCizZ, {q, 7} = CyyCP%Z 


If the higher-dimensional theory was massless, then p? + Z? = 0 for the 4D theory. 
More generally, if the higher-dimensional theory already had masses before the central 
charge was introduced, then by supersymmetry it satisfied p? + Mé = 0, Mé > 0 
(since supersymmetry always has positive potentials), while afterwards the 4D theory 
satisfies 

p+Z+Me=0 => M=M24+7>7? 


where M is the 4D mass, in terms of the higher-D mass Mp. However, in general, in 
the absence of central charges, massive representations of supersymmetry are bigger 
than massless ones (because there are twice as many independent supersymmetry 
generators on shell, since q is a spinor with 1 helicity for the massless case, but an 
SU(2) doublet for the massive). So, M? = Z? > 0 has the advantage of allowing 
smaller massive representations than when M? > Z? = 0 or when M2? > Z? > 0. 
Note that when M? = Z?, so all masses arise from the central charge, (total) mass is 
conserved, just as in nonrelativistic physics, although in the relativistic case the mass 
Z can be negative. (Of course, its square is always positive, as is physical energy. The 
relation between the relativistic and nonrelativistic cases can be understood through 
dimensional reduction: See exercise [A4.5. The mass is also a central charge for the 


Galilean group, but there the reduction is for a lightlike dimension.) 


In the present case, we can choose our U(1) symmetry to be a subgroup of the 
extra SU(2) internal symmetry (k’ index) of the 6D scalar multiplet. Note that the 
algebra of the d’s is modified in the same way as that of the q’s. 


Super Yang-Mills is a bit more interesting, because interactions are easier to 
introduce. From the counting arguments given in subsection XC3, we see that a 
supersymmetric theory consisting of 1 vector and 1 spinor can exist in D=3, 4, 6, 
or 10. This corresponds directly with our analysis of the largest dimensions for 
simple supersymmetries: Dimensional reduction of a vector gives also scalars, so the 
condition of no scalars gives maximum dimensions. We now make an analysis similar 
to that of the previous subsection: By dimensional analysis for physical fields, and 


using single-Majorana-spinor-index notation, 
{Va, Va} = —LatV a 


[Va Val _ PoggW? 
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Mas Vo = aans 


Applying the Jacobi (Bianchi) identities, we find 
Puapl ys = 0 


This identity can be satisfied only in D=8, 4, 6, or 10. The Bianchi identities imply 
the field equations for D=10. 


Exercise XC5.2 
Multiply the identity Ioagl me = 8 by 1°°°9, and use the I’ matrix anticom- 
mutation relation ree = n”6% to show that D=3, 4, 6, or 10. 


Similar methods can be applied to D=11 supergravity. Our component counting 
for general dimensions, and our helicity analysis for general extended supersymmet- 
ric theories in D=4 (applied to the dimensionally reduced theory), can be satisfied 
by adding to the metric (44 physical components) and gravitino (128) a third-rank 
antisymmetric tensor gauge field (84) Amnp (with field strength Frinpg = ZO(mAnpa)): 
The action for the graviton and gravitino are like those in 4D N=1, while A has not 


only the obvious quadratic term but also a “Chern-Simons term”: 


L = e'[-iR sc Wm VrPV nVp as 56 (Fabea)” Si wr + w] 


1 mnpaqr stuvwZ 
+ aaa E Anal antl was 


(There are also more-complicated fermion interaction terms than in 4D N=1.) The 
necessity of the last term can be shown by finding the component form of the super- 
symmetry transformations, or by finding the field equations implied by the superspace 


formulation. 


6. Reduction to D=4 


We now look instead at the component formulation of higher-dimensional super 
Yang-Mills. This formulation is off shell except for the lack of auxiliary fields. Since 
the fields are just a vector and a spinor, the Lagrangian consists of just that of super 
Yang-Mills coupled to a spinor in the adjoint representation of the Yang-Mills group. 
Upon dimensional reduction, the vector produces some scalars. For example, the 
D=10 theory has an SO(9,1) symmetry, which reduces in D=4 to the SO(3,1)@SO(6) 
subgroup. The SO(6) symmetry of the 6 flattened dimensions is the SU(4) symmetry 
of the N=4 supersymmetries. Under this reduction, the vector becomes 10 — (4,1) @ 
(1,6), namely a 4-vector and scalars that form a 6 of SU(4), while the spinor becomes 


16 — (4,4), a 4D spinor that is also a 4 of SU(4) (like the supersymmetry generators). 
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Although y (or o, or I’) matrices are necessary in D=10, in D=4 we can convert 
to spinor notation for both SO(3,1) (=SL(2,C)) and SO(6) (=SU(4)). Thus vectors 


and the Minkowski metric reduce as 
Ves (VV); (VMS TR RV, (VY) = Hy = deu™ 
Nak > (CapC x3, €ijhl) V- Wow. + sV"W i; 


while spinors and Pauli matrices reduce as 


pie iCoaViz OV 2 
o+~5/( 1, av, et ae se 
Y amd 5; ( 5 Fog — 5'Visg iCy,V9 


DBVX > Vaak bieKs® — x8h:*) + Li(Viyh a + VIDE Ka) 


The two terms in the 10D Lagrangian then reduce as 


iF? + s[V, dis . ky oe") — aa lbis, dua] [o%, oe") 
Bie [iV aa VO] + gt( (Hig, V2] + Bi*[6", dja) 


Exercise XC6.1 
Looking at the SU(3) subgroup of SU(4), decompose the states of N=4 super 
Yang-Mills into those of N=3. (Use the analysis of subsection IIC5 to count 
states, in SU(N) representations.) Do the same to decompose N=4 into N=2 
super Yang-Mills plus scalar multiplet, this time using the SU(2)@SU(2) sub- 
group for which 4 — (3,0) @ (0,4) (ie., i (i,7’)). This is another way of 
understanding where the second SU(2) of the scalar multiplet comes from. 
Exercise XC6.2 
Derive the commutation relations of the N=4 Yang-Mills covariant derivatives 
of subsection IVC7 by dimensional reduction of those for 10D N=1 given in 
the previous subsection. (Don’t forget the scalars come from the components 


of the vector covariant derivative in the extra dimensions.) 


Dimensional reduction of (super)gravity is an example of the comparative sim- 
plicity of the vierbein (covariant derivative) formalism vs. the metric or even inverse 
vierbein (differential form) formalisms. The reason in this case is that gravity is 
treated like Yang-Mills theory, and gauge vectors result from reducing the graviton. 


This is seen most easily from comparison of the coordinate transformation laws: 
m n m nm m 
Cbs” SA One, =," aa 


OGg =A One. Ey Oma” 
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OO _ rOnJmn + GotmOn AP 


Fixing the index m = —1 on X” to get the gauge transformations of an Abelian vector 
resulting from reduction from one extra dimension, and setting 0_, = 0 when acting 
on any field as the definition of reduction, we see the identification (in an appropriate 
gauge for the SO(D,1)/SO(D—1,1) generators M_,,) 


m a es” Ay 
€a => 
= ( ne 


where A transforms in the usual way for a gauge vector, and w is an additional scalar. 


A more transparent way to write this is as 
N= XN" Om; €q = €a” Om; d€, = [A, ea] 


A— A os 710.4, €a (65 = AO 4, wo_,) 


which makes it clear that reduction has simply U(1)-covariantized the gauge param- 
eter, transformation, and field, where 0_, is the U(1) generator. (Under reduction all 
fields are U(1) neutral.) On the other hand, the reduction of e,,“, being the inverse 
of e,'", and gmn, being the square of that, yields nonlinear reductions, and the U(1) 
covariantization is not manifest. (In particular, in the metric formalism the metric, 
and thus the U(1) vector, does not even appear in the covariant derivative, except in 


terms with its derivatives. ) 


Exercise XC6.3 


Derive the result of exercise [XC1.1 by dimensional reduction. 


Exercise XC6.4 
Let’s work out the details of this simple example, reduction of pure gravity 


from one extra dimension: 


a Find the reduction of c,,° by examining the commutators of the reduced e,. 
(Fp comes out directly.) Using the expression of the Lagrangian in terms of 
the c’s from exercise IX A5.2, find the reduced action, including a cosmological 
term. (Drop the f da~'. We can think of this as compactification on a circle, 
independence from x~! yielding a constant factor upon integration, which can 
be absorbed.) 


b The scalar appears in a funny way, seen previously in subsection IXB5. Rather 
than field redefinitions, it is more convenient to reintroduce local scale invari- 


ance (after the reduction), as in subsection IXB5, introducing the dilaton @. 
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Then make a simple redefinition that replaces w and @ with the “canonical” 


fields ¢4. (The F? and cosmological terms then appear with powers of d+.) 


Reduction from one extra dimension can give only a single (Abelian) gauge vector, 
but two or more dimensions can yield nonabelian gauge groups as the spacetime 
symmetries of the compactified dimensions. For example, compactifying n extra 
dimensions into an n-sphere gives SO(n+1). (However, compactifying to a box with 
periodic boundary conditions gives an Abelian group again.) The generalization is 
then 

) — DEMG:, €g > (€g+ Ag’ Gy, ¥;'Gz) 


where the only dependence on the extra dimensions is implicit in the group generators 
G,. If we add matter fields (before reduction), then the fields can be constrained to 
be independent of the extra dimensions (i.e., singlets of Gy) when their indices are 
flat. 


Another possible modification is to make the action of the generators on matter 
fields nontrivial. If we already have an internal symmetry group, with generators G I; 


identical to that of the G;, then we can impose on all matter fields ¢ 
Gio = Gyo 


to determine their dependence on the extra coordinates. The fact that the original 
higher-dimensional action was invariant under the G, guarantees that the resultant 
dependence on the extra coordinates will cancel. The simplest example was applied 


to supersymmetry in the previous subsection: In the Abelian case we can set 
~i0_1¢ =mG¢ 


where we are free to scale the Abelian generator by a mass parameter m (unlike the 
nonabelian case, where it would change the algebra). 


Similar results can be obtained for supergravity, but the results are more com- 
plicated, because the scalars (which appear for N>3) appear in nonlinear 0 models. 
Furthermore, although these models can be constructed by the coset method dis- 
cussed in subsection IVA3, the coset space G/H is noncompact, because the group G 
is noncompact, although the subgroup H is compact. This is a consequence of the fact 
that the “compensating” scalars of the group H=U(N) (or SU(8) for N=8) appear 
with the wrong-sign kinetic term (as the dilaton even in ordinary gravity). Thus, con- 
formal supergravity is coupled to “matter” with scalars in the adjoint representation 
of the noncompact group G, while gauging away the compensating scalars leaves the 


physical scalars of the coset space G/H. A simpler analog is N=1 supergravity coupled 
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to a scalar multiplet (see subsection XB7). This is the same as conformal supergravity 
coupled to the matter action ¢¢ — Yy, which has a symmetry G=U(1,1), while N=1 
supergravity has a gauge group U(1). Including Weyl scale invariance GL(1), the 
physical scalars then inhabit the coset space U(1,1)/U(1)®GL(1)=SU(1,1)/GL(1). 


In the case of extended supergravity, the group G can be found by noting that 
the physical scalars parametrizing G/H form the representation $/*! (totally anti- 
symmetric, and complex conjugate) of the group H. We then look for the group G 
whose adjoint representation transforms under the H subgroup as these scalars + ad- 
joint of H. We can also determine G by defining group generators for G as M,/ for H, 
and M;;,. (and hermitian conjugate mM") for G/H, and write commutation relations 
consistent with covariance under H. For N=8 we also have MW!" = eo nd Minna 
(and the same for the corresponding physical scalars). The result for the coset space 
G/H is 


. SU(4) @ SU(1,1)/U(4) = SU(1,1)/U(1) 
 SU(5,1)/U(5) 

- $O*(12)/U(6) 

: Ey47)/SU(8) 


co DM Ot ODS 


where E77) is a noncompact form (Wick rotation) of the exceptional group E7. 


An additional complication is that the vectors represent the full H symmetry 
only on shell. For example, for N=2 we have a single vector, as in electromagnetism. 
Maxwell’s equations without sources have a U(1) symmetry, “S-duality”, that trans- 
forms fag by a phase (and ie by the opposite), that mixes the field equations with 
the Bianchi identities. With sources, it mixes electric and magnetic charge, since it 
mixes electric and magnetic fields. So, in general we must introduce both electric 
and magnetic potentials for each vector. Furthermore, for N=6 the vectors appear 
as both ee and fijkimna3 (one extra vector). (For N=8, the two are related by an 
€ tensor, just as ¢ and ¢.) In this version of extended supergravity, all the vectors 
are Abelian. There is also a version where they gauge SO(N), but that theory has a 


cosmological constant. 
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XI. STRINGS 


There are three areas of application of QCD, as defined by the region of momen- 
tum space they address: (1) One is perturbative QCD, which applies to large relative, 
“transverse” velocity of (some of) the constituents of the hadrons. In this approach 
such an amplitude is divided into a half consisting of high-energy, asymptotically-free 
partons, which is calculated perturbatively in the gauge coupling, and a half con- 
sisting of low-energy, confined partons, which is nonperturbative, and therefore not 


calculated. 


(2) Another area deals with the low-energy behavior of QCD — with properties 
of the vacuum (e.g., broken chiral symmetry), or the lowest-mass hadrons, scattering 
at small relative velocities. This approach is nonperturbative with respect to the 
gauge coupling, and instead perturbs in derivatives, as in first-quantized JWKB. The 
methods used include instantons, lattice QCD, current algebra, dispersion relations, 
nonlinear o models, and duality. This low-energy behavior really says nothing about 
confinement, just as the low-energy states of the hydrogen atom tell us nothing about 


ionization. 


A closely related problem is that the nonperturbative information about QCD 
that comes from (electromagnetic-type) duality considerations, which relates “weak” 
coupling to “strong” coupling as g  1/g, is not really relating quark-gluon physics 
to hadronic physics, but is relating quark-gluon physics to monopole physics; i.e., it 
relates a description of weakly coupled “electric” color charges to a similar looking 
theory of weakly coupled “magnetic” color charges. Thus, the dual theory, being 
formally of the same type as the original, except for a relabeling of what is called 
“electric” and what is called “magnetic”, does not give anything that looks any more 


like hadrons, or make it any easier to calculate. 


(3) The one nonperturbative approach that does deal with high (hadron) energies 
is string theory: It incorporates hadrons of arbitrarily high mass, and studies their 
scattering at high energies. It also shows that stringy (hadron-like) behavior is a 
characteristic of QCD coupling g = 1, while g = 0 or oo have non-stringy (parton- 
like) behavior: String perturbation expands in G = In g, not g nor 1/g; duality is the 
symmetry G «+ —G. Furthermore, this G is the coupling that defines the free string, 
i.e., how partons bind to form strings. The coupling that determines how hadrons 
couple to each other is 1/N,, as described topologically in subsection VC9. (However, 
the relation of duality to the 1/N, expansion is unclear, since duality has been studied 


so far only in relation to theories where the group is spontaneously broken to U(1), 
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so effectively N. = 1, or with respect to instantons, which are always defined for 
SU(2) subgroups, so effectively N. = 2. Thus, it has not been possible to apply such 
duality arguments simultaneously with a 1/N, analysis. Similarly, since both these 
duality approaches deal only with low-energy behavior, they are difficult to relate to 


confinement. ) 


Most of the research effort on string theory has been directed toward models 
with “critical dimension” D>>4 (10, 11, or 26): To describe physics in the real world 
of D=4, it is usually assumed that the extra dimensions choose to “compactify” 
to submicroscopic dimensions, corresponding to length scales well below the range of 
present experiments. (The extra dimensions cannot be completely eliminated without 
losing renormalizability.) Such a solution to the classical field equations with min- 
imal energy is chosen as the “vacuum”, about which perturbations are performed, 
but nothing is known to preclude contributions to the functional integral from other 
vacuua, whether 4-dimensional, 10-dimensional, or elsewhere. Moreover, although 
superstrings have been chosen to describe quantum gravity because of their renor- 
malizability (finiteness), this advantage is lost after compactification, since the ar- 
bitrariness in choice of compactification is tantamount to the loss of predictability 
in nonrenormalizable theories. Furthermore, D=10 superstring theories are (com- 
pactifications of) D=11 membrane theories in disguise, where the eleventh dimension 
shows up only nonperturbatively. Not only is using a formalism where not all of the 
dimensions are manifest a technical obstacle, but the quantum mechanics of mem- 
branes suffers from several problems, including nonrenormalizability. This suggests 
that D=10 superstrings are nonrenormalizable at the nonperturbative level. On the 
other hand, renormalizabilty of theories with a finite number of fields predicts D=4, 
since theories in higher dimensions are all nonrenormalizable (or have unbounded 
potentials: $° theory). Furthermore, both experiments with hadrons and theoreti- 
cal arguments in QCD suggest the existence of an inherently 4D string theory. (For 
example, the existence of a continuum limit for confining spacetime-lattice theories 


requires asymptotic freedom.) 


However, historically the true usefulness of such string theories has been for the 
concepts and features of field theory they have revealed: For example, supersymmetry 
(sections HC and IVC, and chapter X), the Gervais-Neveu gauge (subsection VIB4), 
topological (1/N) expansion (subsection VIIC4), first-quantized BRST approach to 
gauge theory (chapter XII), and certain simplifications in one-loop amplitudes were 
all discovered through studies of 10- and 26-dimensional string theory, even though 
they all are now understood more simply through ordinary field theory. This is due 


to the fact that string theories are so complex and restrictive that they require the 
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most powerful techniques available. Clearly such strings are useful toy models for 
learning about particle field theories, and about general properties of string theory 
that might lead to generalizations to include realistic 4-dimensional string theories. 
(In fact, the first paper on string theory was written in 1747 by d’Alembert, and was 
the first appearance of the wave equation and the d’Alembertian. Thus, field theory, 


quantum mechanics, and special relativity can trace their origins to string theory.) 


In subsections IVB1 and VIIC4 we briefly discussed how hadrons are expected to 
arise as strings from QCD. In this chapter we analyze the dynamics of this mecha- 
nism. We begin by formulating the theory in terms of strings directly. Perturbative 
calculations are performed using first-quantized path integrals. (These methods are 
based on the corresponding ones for the particle from section IIIB and subsections 
VB1 and VIIC5, and massless 2D field theory in subsection VIIB5.) The only ex- 
perimental evidence for strings is as a description of hadrons; to some extent the way 
that QCD leads to strings can be understood with similar first-quantized methods, 


based on random lattices. 


aviaaeeesinanauesd A. GENERALITIES .......000000000....... 


In this section we examine some of the general properties of string theory, shared 
by all known models, but expected to apply also to more realistic strings. These 
features can be used for phenomenological applications of string theory, but also may 


help point to new generalizations. 


String theories are the only known theories that exhibit (S-matrix) duality. Unlike 
other S-matrix approaches, they provide an explicit perturbative calculational scheme, 
like field theory, and string theory can be formulated as a field theory. Also like field 
theory, string theory has consistency conditions at the classical and quantum levels, 


related to gauge invariance and renormalizability. 


When string theory is used as a unified theory of gravity and other forces, the 
most interesting predictions are those for the “low-energy” (with respect to the Planck 
mass) part of the theory. Although the possible low-energy limits of known string 
theories have not all been explored, the indications are that there are only a few 


restrictions beyond the usual field theoretic ones: 


(1) In a term in the effective action, the power of the dilaton counts the number of 


loops, since the string coupling is the vacuum value of the dilaton. 


(2) The spectrum of the closed string is given by the direct product of two open 


strings. 
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(3) String theory has noncompact symmetries, “S-duality” and “T-duality”, resulting 


from the amplitudes also having this direct-product structure. 


All these properties survive the low-energy limit. In the supersymmetric case, the 
last property follows from the first two. (However, the D=10 superstring is actually 
a D=11 supermembrane nonperturbatively. Since the observed features of hadrons, 
as well as qualitative arguments from QCD, indicate stringy but not membrane- 
like behavior, we will abstract only the perturbative features of higher-dimensional 


strings.) 


1. Regge theory 


In principle there is no difference between a fundamental state and a bound 
state: We can always write an action with every state represented by an independent 
field. Of course, such an action might not be renormalizable, but that seems more 
of a formal distinction. A more physical one is based on the qualitative property 
that bound states have radial and other excitations with related properties, while 


fundamental states are more unique. 


Regge theory is an approach to bound states that treats them as fundamental. A 
family of states that are different excitations of the same ground state is treated as a 
single entity. Although basically an approach based on fundamental properties of the 


S-matrix, when combined with perturbation theory it leads directly to string theory. 


A quantitative definition of this concept follows from a generalization of a concept 
seen in perturbative field theory. In amplitudes following from Feynman diagrams 
the nature of intermediate states can be seen from the momentum-space behavior: 
Single-particle states appear as poles (in the sense of complex analysis) in some mo- 
mentum invariants, 1/(p? + m?), where this p is the sum of some of the external 
momenta, representing the momentum of the internal state. (Any tree graph is a 
simple example.) Two-particle states appear as cuts in these invariants, where the 
branch point represents the state where the two particles are at rest with respect to 
one another, and the rest of the cut corresponds to arbitrary relative velocities. (For 
example, a one-loop propagator correction has a branch point at —p? = (m, + mz)? 
for intermediate particles of masses m, and m2.) Similar remarks apply to other 
multi-particle states. “Analytic S-matrix theory” was an attempt to formulate par- 
ticle physics in terms of the S-matrix by replacing the property of locality of the 
action with “maximal” analyticity of the S-matrix in momentum space. (Of course, 
unitarity and Poincaré invariance can be described easily in terms of the S-matrix; 


even analogs of renormalizability can be formulated in terms of certain properties of 
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the high-energy behavior.) Unfortunately, the most general form of such nonanalytic 
behavior (poles, cuts, etc.), as discovered from analyzing Feynman diagrams, proved 


to be too complicated to provide a practical method for defining a theory. 


t 


Ss 


Since Poincaré invariance means that not only momentum is conserved but also 
angular momentum, a natural next step was to consider the analytic behavior in that 
variable as well. This behavior is seen already in nonrelativistic theories; here we will 
approach the concept in a language most relevant to relativistic physics. The simplest 
example of “Regge behavior” is the 4-point S-matrix; this is the relativistic analog 
of a nonrelativistic particle in a potential. (We can think of an infinitely massive 
second particle as producing the potential, or separate center-of-mass and relative 
coordinates for two finite-mass particles.) Also, the Feynman diagrams that appear 
in the nonrelativistic problem are “ladder diagrams”: The sides of the ladder represent 
the two scattering particles, while the rungs represent a perturbation expansion for 
the potential. It can be shown that such diagrams give the leading behavior of this 
amplitude at high energies. Here the appropriate high energy limit is defined in terms 


of the Mandelstam variables (see subsection IA4); by high energy we mean, e.g., 
s—-—oo, t fixed 


We should really look at s — +oo0 for a physical amplitude, i.e., total center-of- 
mass energy — oo. But then we would run into the poles in that channel, from 
“annihilation”, for intermediate states of positive (mass)?, so instead we take s > 


—oo, which has a well-defined limit, and later analytically continue Re(s) — +00. 


The high-energy behavior of ladder diagrams can be shown to be of the form (in 


units of an appropriate mass) 
Aa(s,t) = kg’ P[-a(t)|(—s), a(t) =a+9°b(t) 


where a is a constant that describes the behavior of the tree graph, and b(t) is deter- 
mined by the one-loop graph, but gives the leading contribution of all the higher-loop 
graphs. (Both, and the constant k, are independent of g.) This amplitude takes a 
simple form under a modified type of “Sommerfeld-Watson transform”: 


co 


Aa(s,t) = fx P(—J)(—s)"Ag(J,t) = S> 8” Aa(J, t) 


277% 
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ly ©) a(t) 
rN 


The contour integral is taken as clockwise about the positive real axis to obtain the 
last form, where it picks up the poles of [’(—J) (see exercise VIIA2.3b), but can be 
deformed to surround the singularities of A,. In this case, a pole at J = a(t) in A, 
will reproduce our original ladder amplitude, while integrating it around the positive 


real axis gives 


A,(J,t) = ba = Ai(s,t) =k >> 4072 _ 
which shows that particles of spin J contribute simple poles in t to the amplitude 
at a(t) = J when a(t) can be approximated as linear near that value. The spin of 
the intermediate particle follows from the s’ factor. (This is clear from examining a 
4-point tree graph where the external lines are scalars and the internal line carries J 
indices, and must contract the momenta of its two ends. There are also contributions 
of lower spins from traces.) Thus the “Regge trajectory” a(t) determines not only 
the high-energy behavior of the amplitude (for negative t), but also the spins and 
masses of the bound states (for positive t): Looking at the graph for J = a(t), there 
is a bound state of spin J and mass /t whenever the curve crosses an integer value of 
J. The contribution at n loops in perturbation theory to A,(J,t) is a multiple pole 
(J —a)~(*»), which contributes to A,(s,t) a term proportional to (—s)*[In(—s)]". 
Exercise XIA1.1 

Calculate the tree scattering amplitude of two spinless particles of equal mass 

due to the exchange of a particle of spin J with coupling and propagator as 

given at the end of subsection IIIA4. Show that at the pole in t the leading 


contribution in s goes as s’. 


Exercise XIA1.2 
Consider the amplitude 


A dre™[f(r)/ 9, f(0) = 0, f'(0) 40 


where f is Taylor expandable. By expanding f, show that a sum of Regge 
amplitudes is obtained, where the “leading trajectory” is a(t), and there are 


“daughter trajectories” a(t) — n for positive integer n. 
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Exercise XIA1.3 
Consider the energy spectrum of the hydrogen atom (nonrelativistic, with 


spinless constituents). Show that this corresponds to a leading Regge trajec- 


x= B-1 


for some constant Eo, with daughter trajectories. 


tory of the form 


Unfortunately, for field theories with a finite number of fundamental particles, the 
trajectories are rather boring, containing only a finite number of bound states. In cer- 
tain cases a trajectory may include one of the fundamental particles itself (“Reggeiza- 
tion”). Because of the usual infrared divergences, such calculations can be applied 
directly to S-matrix elements only for fundamental massive particles; for fundamental 
massless particles, as in confining theories (like QCD), these results require external 
lines to be offshell, and some knowledge of the parton wave functions is needed. 
Regge behavior thus gives a measurable definition of confinement: If the scattering 
amplitudes of color-singlet states (or color-singlet channels of off-shell amplitudes of 
color-nonsinglet states) have linear trajectories, the constitutent color-nonsinglet par- 
ticles can be said to be “confined”. On the other hand, if the Regge trajectory rises 
only to finite spin and then falls, as with the Higgs effect, then there is only “color 
screening”; color-singlet states might not be observable, but we do not see the infinite 
number of radial excitations characteristic of confinement. Another possibility is that 
arbitrarily high spin is reached at finite energy: This is characteristic of Coloumb 


binding, and indicates that a new, “ionized” phase is reached above that energy. 


Coulomb 


confining 


Experimentally, hadrons are observed to have Regge behavior with respect to 
both high-energy behavior and spectrum. However, those Regge trajectories are 
approximately linear, thus indicating an (near) infinite number of bound states. The 


linearity of the trajectories can be shown to be related to the relative stability of these 
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unstable particles (as compared to what is found in ladder approximations). This 
suggests a formulation of the theory of hadrons where the whole Regge trajectory 
is treated as fundamental. It can be shown that in any such “Regge theory” based 
on a perturbation expansion where the “tree” graphs have only poles in the angular 
momentum J (whose accuracy is implied by the linearity of the observed trajectories), 
that the theory has a further property called Dolen-Horn-Schmid (s-t) “duality”: This 
property states that the amplitude can be expressed as a sum of poles in either the s 


or t “channel”, rather than as a sum over both: 


oo Gilt) — a Cn(s) = 
fe Drei > are x ><K =r 
This holds even when the sets of particles exchanged in the two channels are dif- 


ferent, due to quantum numbers of the external states. This relation has also been 


experimentally verified (approximately). 


Explicit realizations of such “dual models” of the S-matrix in terms of first- 
quantized systems are called “string theories”. They explain the linearity of the 
Regge trajectories by the harmonic-oscillator structure of the string Hamiltonian, 
and the duality of the amplitudes by the conformal invariance (“stretchiness” ) of the 


string worldsheet. 


2. Topology 


The defining concept of the string is that it is a two-dimensional object: Just as 
the particle is defined as a point object whose trajectory through spacetime is one- 
dimensional (a worldline), the string has as its trajectory a two-dimensional surface, 
the “worldsheet”. There are two types of free strings: open (two ends) and closed 


(no boundary). Their worldsheets are a rectangle and a tube (cylinder). 


Oo 


This leads to a much simpler picture of interactions for strings than for particles. 
For particles, one rarely uses first-quantization to describe self-interactions. Generally, 


relativistic quantum mechanics is limited to free particles, or particles in a fixed 
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background. On the other hand, the quantum mechanics of strings is often the best 


way to describe quantum strings perturbatively in the string coupling, for two reasons: 


(1) For interacting particles, the geometric picture of a worldline becomes a graph, 
whose geometry is not differentiable at the interaction points, where the curves split. 
For interacting strings, we have instead a differentiable surface (worldsheet) with non- 
trivial topology: sphere, disk, torus (doughnut), etc. The external states are described 
by boundaries that become disjoint at time t = +oo (not worldsheet parameter T). 
For example, a tree graph now looks more like a real tree, in that the branches now 


have thickness, and they join smoothly to the rest of the tree. 


(2) The quantum mechanics of strings is invariant under 2D conformal transfor- 
mations of the worldsheet. (But the quantum field theory isn’t conformal in space- 
time, because there is a discrete mass spectrum.) As a result, the worldsheet can be 
“stretched” to the extent that field theory tree diagrams are described by the same 


surfaces as propagators. 


The fact that the string worldsheet is described by conformal geometry rather 
than the usual geometry means that the worldsheet metric is reduced to just a few 
parameters (called by mathematicians “moduli” ), and topology (which doesn’t even 
require a metric). These parameters are similar to those that appear in Feynman 
diagrams for particles (so the 2D metric in some sense has been reduced to a 1D 


metric), but the topology of surfaces is much different from that of stick graphs. 


From this topological point of view, string diagrams are equivalent if they can 
be “stretched” into one another. An explicit way to show this is using Dolen-Horn- 
Schmid duality. We have mentioned in the special case of the 4-point amplitude that 
summing over poles in one channel is equivalent to summing in the other. This result 
can be generalized: We can write any string graph as an ordinary Feynman diagram 
with just cubic interactions, but with any 4-point subgraph satisfying duality. (This 
is not string field theory, whose graphs are not separately dual.) So we can use duality 
to relate graphs of the same 2D topology, and must not double-count by summing 


graphs that are topologically equivalent. 
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In particular, in any loop graph any loop can be moved anywhere, into a prop- 
agator or an external line, or can be pulled out to form a tadpole (string going into 
the vacuum). The result is that any graph is equivalent to a tree graph with in- 
sertions of some one-loop open- or closed-string tadpoles. However, this does not 
mean that any graph constructed with only open-string propagators and interactions 
can be expressed as an open-string tree graph with tadpole insertions: The one-loop 
open-string graph with two “half-twists” on the open-string propagators in the loop 
is equivalent to a tree graph with a closed-string intermediate state, as can be seen 
by stretching the surface, or by tracing the routes of the boundaries. (For example, 
drawing this graph in a psuedo-planar way, as a flat ring with external states con- 
nected to both the inner and outer edges, pulling the inner edge out of the plane 
reveals a closed string connecting the two edges.) This phenomenon is similar to 2D 
bosonization: A closed string can be represented as the “bound state” of two free 
open strings just as a massless scalar in D=2 can be represented as the bound state 


of two free massless spinors. 


There are only 3 types of 1-loop insertions to consider (and for “orientable” strings 
only 2): 
1) handle 
2) window (hole) 
3) cross-cap (like a nonorientable hole) 
In string theory the coupling is topological, in the sense that the power of the coupling 
constant is counted by (minus) the “genus” of the worldsheet, the “Euler number” 
x, given by the integral of the worldsheet curvature (see exercise IXA7.3). However, 


in counting string loops, the last two of the 3 listed above count as open-string loops, 


while the first counts as a closed-string loop, which is equivalent to 2 open-string 
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loops. Consequently, the closed-string coupling is the square of the open-string one: 
The Euler number is 
=2-—-2h-—w-c 


- > 


The cross-cap is a hole with opposite points identified: It thus actually does not 
introduce a boundary, but does introduce nonorientability. If the number of cross- 
caps is more than 2, it can be reduced to 1 or 2 by replacing pairs of cross-caps 
with handles. Notable examples of surfaces with cross-caps are the projective plane 
(sphere with 1 cross-cap), Klein bottle (sphere with 2 cross-caps), and Mobius strip 


(sphere with 1 cross-cap and 1 window). 


cross — Cap 


Analyzing the 1-loop insertions as tadpoles also makes it easy to interpret di- 
vergences and how to renormalize them: Tadpoles contribute to vacuum values of 
scalars. In string theory, coupling constants are also vacuum values of scalars (the 
string coupling g from the vacuum value of the dilaton, the slope a’ of the string 
Regge trajectory from the vacuum value of the determinant of the metric tensor). 
Thus, string divergences correspond to renormalization of couplings. However, we 
know that divergences in quantum gravity can lead to difficulties, so it may be useful 
to try and cancel them. The handle is a closed-string tadpole with a closed-string 
loop (torus) at the end. Since the propagator connecting a tadpole to the rest of 
the graph is at zero momentum (by momentum conservation), the divergence of this 
graph reduces to essentially a counting of states in the loop. In the superstring, the 
bosonic and fermionic contributions running around this loop cancel. On the other 
hand, the 2 remaining types of tadpoles turn out not to be finite by themselves. How- 


ever, their divergences can cancel each other, for either the string or superstring, if 
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the gauge group of the open string is SO(2?/?). For one such insertion into the sphere, 
this is cancellation between the disk and projective plane. For 2 such insertions, it is 


between the annulus (cylinder), Klein bottle, and Mobius strip. 


+O 


projective disk 
plane 


+O) )- ° 


Klein + Mobius + annulus 


Il 
fo) 


Loops in open-string graphs can have half-twists in them. Such graphs are ori- 
entable if the number of half-twists in a loop is even. At 1 loop, such twisting is the 
same as putting some external lines on the inner boundary of a planar loop and some 
on the outside. On a loop with no strings attached to one boundary, that boundary 
is just a hole, a closed string extending into the vacuum. (An annulus is topologically 
the same as a cylinder.) But a loop with open strings attached to both boundaries 
is the same as a tree graph with a closed string attaching the two sets of states. If 
one calculates such a graph in open-string theory, no divergences are found, except 


for the poles of these closed-string states. 


Exercise XIA2.1 
An exercise in pictures for a subsection on pictures: Draw a 2-loop open 
string diagram that looks planar when 2 external open-string states are drawn 
coming from each of the 3 boundaries, when the external states are drawn 
inward for inner boundaries and outward for outer (as for the 1-loop diagram 
above). Show this is equivalent to a tree graph with a 3-closed-string vertex. 


Generalize to an arbitrary number of loops. 
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3. Classical mechanics 


We now consider string theory as derived by first quantization. As for particles, 
the first step is to study the classical mechanics, which determines the appropriate 
set of variables, the kinetic term of the field theoretic action, some properties of 
the interactions, and some techniques useful for perturbation. Just as the simplest 
such action for the particle produces only the relatively uninteresting case of the 
scalar, the most obvious action for the string yields a model that is not only too 
simple, but quantum mechanically consistent only in 26 dimensions. However, this 
toy model exhibits many relevant qualitative features, such as Regge behavior and 
duality. Later we'll consider the source of its problems by relating to four-dimensional 


particle theories. 


The simplest classical mechanics action for the string is a direct generalization of 


that for the massless scalar particle: For the Lagrangian form of this action we write 
1 [do on ; 
Sp = al ii On ¥ —99 5(OmX )(OnX°) Nas 


where X%(o0") is the position in spacetime of a point at worldsheet coordinates 0” = 
(0°, 01) = (7,0), g”"(o™) is the (inverse) worldsheet metric, and a’ is a normalization 
constant related to the string tension. It can also be associated with the flat-space 
spacetime metric 7»; if we couple a spacetime metric, then its vacuum value can 
be taken as 7»/a’, where a’ is the gravitational coupling, as discussed in subsection 
IXA5. If we vary this action with respect to X, we get its 2D wave equation, covariant 


with respect to the curved worldsheet: 


Ox*= Fa nV=G0"" 0X" =0 

A new feature of this action (compared to the particle’s) is that it is (2D) Weyl 
scale invariant (see subsection IXA7). This gauge invariance can be used to gauge 
away one component of the metric, in addition to the two that can be gauged away 
using 2D general coordinate invariance. The net result is that the worldsheet metric 
can be completely gauged away (except for some bits at boundaries), just as for 
the particle. However, this same invariance prevents the addition of a worldsheet 
cosmological term: In the particle case, such a term was needed to introduce mass. 
Here, mass is introduced through the coefficient 1/a’ of the (0X)? term: The same 
scale invariance that prevents use of a cosmological term also prevents this coefficient 


from being absorbed into the definition of the worldsheet metric. 


734 XI. STRINGS 


Just as for the particle, the metric can be eliminated by its equation of motion, 
resulting in a more geometrical, but less useful, form of the action: In this case the 


equation of motion (“Virasoro constraints” ) 
after taking the determinant of both sides, gives 
1 [do 
S=— | —vV-g Dine = (OmxX ) (One 
at fap VG Grn = (OmX) » (OnX) 


This is the area of the string in terms of the “induced” (intrinsic) metric Gmn, analo- 
gously to the particle case. The induced metric measures length as usually measured 
in spacetime: 


do™ do" Gra = (do OnX ) * (do"0,,X) = (dX)? 


Equivalently, this action can be written in terms of the area element dX*% A dX°: 


1 
=a / \/—-4(dX*AdX*)?,  dX* AdX? = (do°O)X") (dod, X")) 


For purposes of quantization, it’s also useful to have the Hamiltonian form of 


the action. This also allows us to see how the Virasoro constraints generalize the 
Klein-Gordon equation, and then find the BRST operator. By the usual methods of 


converting from Lagrangian to Hamiltonian, we find 


Po = 
Su = [| ZX P+, H = Sho! P? + ao" 1x) + MX’. p 
2n 911 gu 


where © = 0) and’ = O,. Various combinations of components of the worldsheet 


metric now appear explicitly as Lagrange multipliers. If we define 


A A 


Pay =Fla?P ta ?X') = (Puy, Po] =0 


the constraints can be written as two independent sets aa 


Exercise XIA3.1 


Show that if we call g, the Lagrange multipliers for Pixs then in convenient 


VS 


local Lorentz and Weyl scale (but not coordinate) gauges we can write in a 


lightcone basis 


i =e."0,= FP + g-0;) 


while in another scale gauge we can write 


ge = Lge gs cede") 


ns 
v2 
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Exercise XIA3.2 
Find the (equal-r) commutation relations [P, \»P )|. Show that the (semi- 


A 


classical) commutation relations of the constraints Pe) close. (Hint: Use the 


identity 
f(a)o'(a — b) = f(b)d'(a— 6) — f'(b)d(a—6) . ) 


Since 2D general coordinate (and even just Lorentz) invariance is no longer man- 
ifest in the Hamiltonian formalism, for some purposes we need to generalize this to a 


form that is first-order with respect to both 7 and o derivatives: 


1 faa ; 
Se. ee, _ pm _y7\—1/2 lpm, pn 
81 = f Fn X)-P™ + (-9) om P™ PP 


obviously reproduces S; after eliminating P™ by its equation of motion 
Pi! = —/=Gg"nX 
Eliminating just P! gives a simpler way of deriving Sy (with P® = a’P). 

Since open strings have boundaries, the action implies boundary conditions, orig- 
inating from integration by parts when deriving the field equations. In the last form 
of the action variation of the first term gives, in addition to the { d?o terms (6P)-0X 
and —(0X)- OP for the field equations, a boundary term ¢ do™€mn(dX) +P", where 


do™ is a line integral along the boundary, and the €, picks the component of P™ 


normal to the boundary. We thus have 
NmP™ = 0 at boundaries 


where n,, is a vector normal to the boundary. This condition on the derivative of 
X (“Neumann” boundary condition) causes waves propagating in the string to be 


reflected at the boundaries. 


A simple interpretation of this boundary condition is to consider an open string as 
a closed string “folded over” on itself: At any fixed 7, following X(c) for increasing 
ao takes one along the usual open string, but then doubles back at a boundary to 
backtrack along the same path, and the same at the opposite boundary, becoming 
periodic as for the closed string. This periodicity is convenient for -Fourier expanding 
in exponentials, rather than sines and cosines. Continuity in X upon reversal at the 
boundaries implies the Neumann boundary condition, but implemented in the usual 


way for 2D problems, by the method of images, due to this doubling. 


From the constraint imposed by varying gmn, it then follows that 


(tmP™)? = 0 at boundaries 
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where t,,, is a vector tangent to the boundary (or any vector, for that matter). Since 
by the field equations P™ ~ g'"0,X, this means that the boundary is lightlike in 
spacetime: The ends of the string travel at the speed of light. 


4. Types 


There are various types of known string theories: Some are supersymmetric and 
some are not. (The supersymmetric ones appear to be equivalent to each other 
nonperturbatively, and equivalent to membranes, but here we restrict ourselves to 


perturbation theory.) 


Besides supersymmetry, there are geometric distinctions. One of these is be- 
tween open and closed strings: Closed strings have modes that are either left- or 
right-handed, i.e., propagating in either o-direction. For open strings these modes 
are identified, since left-handed modes become right-handed upon reflection at the 
boundary (and vice versa). For closed strings they are independent, and may have 


different supersymmetry properties. 


Since the open strings have ends, we can associate internal symmetry indices 
(“Chan-Paton factors”) with them, as found in subsection VIC4, following from the 
same in subsection VC9 for ordinary particle field theory. These indices can also 
be associated with worldsheet variables that live only on string boundaries. As in 
the field theory case, these indices are associated with orientation of the boundaries 


(arrows) only for U groups, not for SO or USp. 


As we'll see in section XIB, quantization of known open strings always produces 
Yang-Mills at the massless level (super Yang-Mills for open superstrings). Since closed 
strings have effectively two sets (left and right) of open string modes, the closed- 
string Hilbert space is effectively the direct product of two (perhaps different) open- 
string Hilbert spaces (with an added restriction implied by o-translation invariance, 
to be discussed later). In particular, at the massless level this direct product of two 
vectors can give a graviton (symmetric, traceless tensor), scalar (trace), and axion 
(antisymmetric). In the case that the two open strings are the same, it is possible to 
restrict this direct product to its symmetric part. This eliminates the axion, but not 


the scalar. Thus, a massless scalar appears even in the simplest case. 


Another geometric property we discussed topologically was orientability of the 
worldsheet. To understand orientability, we examine the discrete symmetries of the 
worldsheet. Wave functions or fields describing the string can be expressed as func- 


tionals of X at fixed 7 (just as for particles). We also choose o to run from 0 to z for 
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the open string, and from 0 to 27 (periodic) for the closed. (This choice as a gauge 
condition will be discussed in more detail in subsection XIB1.) As in D=4, local, 
unitary, Poincaré invariant 2D field theories are always CPT invariant. In particular, 
CPT doesn’t switch left- and right-handed modes (the “velocity” do/dr is invariant), 
which differ in some string theories. Thus, we can always impose invariance of the 
wave function/field under the worldsheet CPT transformation: As the generalization 


of the particle condition 4(x) = ¢'(x), we have for the string 
worldsheet CPT: &[X(o)| = &'[X (a — o)] 


where parity transforms 0 — a —o to preserve ge|0,7] for the open string; for 
the closed string the a is irrelevant because of periodicity and invariance under o 
translation. (We have written only the X coordinate explicitly for simplicity; similar 
remarks apply to other coordinates, such as ghosts, with possible extra signs due 
to 2D Lorentz indices.) Hermitian conjugation for the open string (for the closed 
string the field is not a matrix), instead of just complex conjugation (for C), simply 
switches the internal symmetry factors associated with the left and right ends of the 
open string (matrix transposition), as also required by parity. In particular, this 
implies that the matrices associated with the Yang-Mills fields are hermitian, so the 
Yang-Mills group is unitary. 

In addition to this reality condition, if the 2D theory is also invariant under CP 
and T, it is also possible, though not necessarily required, to impose such a quantum 
mechanical invariance under CP, and thus T: As a generalization of the particle’s T 
condition (see subsection IA5), 6(2) = M¢*(x)M~! © ¢(x) = Md? (x) Mt, for the 
string 

worldsheet T : &[X(o)| = MG*[X(c)|M1 e 


worldsheet CP: 6[X(c)| = MG" [X(n — 0) |M 


As for Yang-Mills in the particle case, for the open string the matrix “M” is the 
group metric (we drop the M and the 7 for the closed string, which is not a matrix), 
either symmetric or antisymmetric depending on whether the group is orthogonal 
or symplectic; without imposing T and CP the group is just unitary. Thus, all the 
classical groups are allowed (at least in the classical field theory; exceptional groups 


cannot be described by associating indices with the ends). 


Since imposing invariance of the states (not just the action) under CP and T 
makes it impossible to observe the left/right handedness of the worldsheet, such 


strings are “unoriented”, as opposed to the “oriented” strings that satisfy just the 
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CPT condition. Thus, orientability of the surface is directly related to orientability 


of its boundaries (oriented for U, unoriented for SO or USp). 


Also, as in the particle field theory, unorientability allows “twisted” worldsheets 
that are prohibited in the oriented case (because we can distinguish the “front” of the 
worldsheet from the “back” ): This allows such exotic geometries as Mobius strips and 
Klein bottles. Open strings produce closed ones as bound states (open and closed 
strings are parts of the same worldsheet with different boundaries); in theories of 
open and closed strings, they must be both oriented or both unoriented. Since the 
worldsheet-CP and -T switch lefty and righty modes, this invariance on the closed 
string results in the restriction introduced earlier, keeping only the symmetric part 


of the direct product. 
Exercise XIA4.1 


What is the difference between an unoriented closed string (satisfying this 
worldsheet CP condition) and the interpretation of subsection XIA3 of the 


open string as a closed string folded over on itself? 


The known string models all have massless particles. A string model with massless 
particles can be applied to hadrons only if masses are given to all these states through 
the Higgs mechanism or some other change in the vacuum. An alternative is to use 
such a model to describe fundamental massless particles (graviton, photon, gluons, 
neutrinos), although this would also require the usual Higgs of the Standard Model 
for generating masses for some particles (W, Z, quarks, charged leptons, Higgs). In 
particular, all known string models have a graviton, and there is no known method 
whereby this graviton would gain mass, so these models seem suited only for unified 
theories of gravity plus matter. For this purpose, the massive fields have little phe- 
nomenological interest. They might improve high-energy behavior, but only near the 
Planck scale, which is effectively unobservable. Therefore, it is necessary to analyze 
the massless subsector of such string theories to find signs of fundamental strings in 


nature. 


The massless sector of the open string includes spin 1 and no higher. This is 
true for the known string models, and also is expected to be a general result, since 
otherwise the closed string would include massless states with spin higher than 2, for 
which no consistent interacting theory is known. Spin 1/2 leads to supersymmetry, 
as described below; we first consider bosonic strings. For convenience (and ultimate 
utility) we consider 4D states; in presently known strings these are the massless states 


in perturbation about the compactified vacuum. 
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The bosonic string contains at least the graviton, a scalar (usually going by the 
misnomer of “dilaton” ), and a pseudoscalar (the “axion”, described by an antisym- 
metric-tensor gauge field), and can contain additional vectors and scalars (if the 
open strings had scalars in addition to the vector). This analysis can be performed 
covariantly, but it is simpler to use a helicity or lightcone analysis. Then the helicities 
of the closed-string states are just the sums of those of the open strings: For the 


product of two vectors (the minimal case), 
(+16 -1) @ (416-1) =+200000-2 


giving the graviton, scalar, and axion. Similarly, additional scalars for one open string 
give additional vectors for the closed, while additional scalars for both open strings 


give also additional scalars. 


Exercise XIA4.2 
Make the same analysis in terms of covariant fields, both for the fields them- 
selves and their gauge transformations. Note that the trace of the gravita- 
tional field h,» (determinant of the metric gmn) is missing. (It’s unphysical, 


and can be found from the ghost sector, as explained in chapter XII.) 


Considering the massless spectrum of superstrings, we now look at the restric- 
tions imposed by supersymmetry whenever fermions are included. The open string 
can also contain massless spin 1/2, but only if it is related by supersymmetry to its 
massless spin 1, since it leads to spin 3/2 in the closed string, and massless spin 3/2 
is known to be inconsistent in an interacting theory unless related by supersymmetry 
to the graviton. (Spin 3/2 gauges supersymmetry. But spin 1 can’t couple minimally 
to spin 3/2: see exercise XIIB7.2b below. So, spin 3/2 needs spin 2 as its supersym- 
metric partner.) Thus there are two possibilities for the massless sector of each open 
string: (1) vectors and scalars for an open bosonic string, or (2) vector multiplets 
(vectors, spinors, and scalars, all related by some number of supersymmetries) for 
an open superstring. From our analysis of subsection I[C5, there are furthermore 3 
types of vector multiplets in D=4, corresponding to N=1,2, or 4 supersymmetries. 
(These result from compactification from N=1 in D=10, depending upon how much 


supersymmetry is broken.) 
This leads to four types of closed strings: 


(1) The bosonic string, from bosonic ® bosonic was discussed above (actually 2 types, 


if we distinguish oriented and unoriented). 


(2) The “heterotic” string comes from bosonic ® super. It thus can have N=1,2, or 
4 supersymmetries. 


740 XI. STRINGS 


(3) The “(Type II) superstring” comes from super ® super. The total number of its 
supersymmetries is the sum of those from the open strings: Depending on the type 
of supersymmetric open strings used, the superstring can have N=2,3,4,5,6, or 8 
(in other words, anything greater than 1, since N=7 supersymmetry is equivalent 


to N=8, and 8 is the maximum for supergravity). 


(4) In the super case, if the left and right open strings are the same, we can impose 
symmetry as in the bosonic case (“Type I”). Then we may also include the open 
strings in the spectrum: This symmetrization also identifies the left and right 
supersymmetries, so then N=1,2 or 4, the same for open and closed states (so 
they can be consistently coupled). 

The spectrum again can be analyzed by helicity: For example, for the N=1 het- 


erotic string, we have 
(1650-5; 0-1) ® (10-1) = (2020-2 © -2) ®(§ €6080@ -4) 
which is supergravity plus a scalar multiplet. As for the bosonic string, all supersym- 


metric closed strings include the scalar, again coming from vector ® vector. 


Exercise XIA4.3 


Make the same analysis for some other supersymmetric cases: 


a N=2 and 4 heterotic. Also make a simpler analysis using “superhelicity” , 


writing any supermultiplet as the lowest-helicity one ® some helicity. 
b N=2 Type II. 
c N=1 Type I. 


5. T-duality 


Another symmetry of all known string models is “T-duality”. It is closely re- 
lated to the open ® open structure of closed string states, and thus expected to be 
a general property of string theory. We consider the simple bosonic model as an 
example. Including constant background fields, working with flat worldsheet metric 


(the “conformal gauge”: see subsection XIB1) for convenience, the Lagrangian is 


where +,— are lightcone worldsheet indices, the curved indices m,n now refer to 
spacetime, Gn, is the spacetime metric, and B,,, is an antisymmetric tensor gauge 


field (“axion”). Writing the action in first-order form 


Li = =P,,,0_X™ — P09 X" + PigP_,M™ 
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where M"™" is the inverse of M,,,,, we vary X instead of P to solve the field equation 
O,Pm+O_Pim=0 => Pim=Xm, Pm =—O_Xn 
and substitute to find the “dual” Lagrangian 


I! = (0. Xm)\(O-X,)M™ 


Thus the “duality transformation” from X to X is an invariance of the theory, 


as long as we also transform the background: 
RX Mase 
Note that in flat space (M™" = 7'""), using the P equation of motion in L’, we have 
P pp = Wak “3 Pa =o 


so duality just changes the sign of the right-handed modes (O_X = 9% ) while 
leaving invariant the left-handed ones (0,X = 0,X). (The treatment of the “zero- 
modes”, those killed by the derivatives acting on X or X, is more tricky: We have 
ignored them by taking the background constant.) We can see this to lowest order 
in the background, since M — M7, to lowest order in perturbation about (M) = n, 
changes the sign of the field, corresponding to the fact that their “vertex operators” 
(coefficients of linearized background fields) are linear in both left- and right-handed 
modes ((0,X)(O_X)). However, in full nonlinearity, duality mixes the spacetime 
metric Ginn(X) and axion Bny(X). 


This invariance can be generalized to a continuous O(D,D) symmetry by com- 
bining it with (global) Lorentz transformations. The above discrete symmetry is a 
kind of “parity” for this larger group: There are also “reflections” from performing 
the duality on just one component of X™. The easiest way to see the full symmetry 
is in the Hamiltonian formalism, where it can be made manifest: We first combine 


X'” and the canonical momentum P,, into an O(D,D) vector: 


Ji (Paya) 


0: 08 
=> [Zu(1),Zn(2)] =—20i6'(2—1)nmn, ON = ire ) 
where the O(D,D) metric 7,7 is constant even in curved space. (We have abbreviated 


“1” for “o,”, etc.) The Virasoro constraints are then 


an’ ZuZn = MN Zu Zn =0 
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where M is not only symmetric but also an element of the O(D,D) group: 


Cia GC” Bon 


M™MN a 
—BrpG?” Ginn — BrpG?*Ban 


) = MNM — pMP(-) pon? 

If the fields are constant in only d of the D dimensions, than the symmetry is re- 
duced to O(d,d); thus O(d,d) is a symmetry of the dimensionally reduced theory 
with arbitrary fields. 


Exercise XIA5.1 
Show that the conditions on M can be solved in a manifestly O(D,D) covariant 


way by use of a “vielbein” E4™: 
M=M*, MnM=n (n= =1") 


ab 0 
=> M=E'HE, EnE’=n, f= @ w) 


Show that M is invariant under a local O(D—1,1)@O(D—1,1) transformation 
on FE, so FE is an element of the coset space O(D,D)/O(D—1,1)@O(D-—1,1) 
(see subsection IVA3). 


6. Dilaton 


We can extend the spectrum analysis of subsection XIA4 off shell: The procedure 
(to be justified in chapter XII) includes the ghost and antighost (multiplets) for the 
vector (multiplet) as a doublet of the ghostly Sp(2) symmetry. The direct product 
of vector ® vector now clearly gives a traceless symmetric tensor (graviton), the 
corresponding trace (physical scalar), and an antisymmetric tensor (axion). In the 
direct product of the ghosts, the Sp(2) singlet gives the trace part of the metric tensor, 
which is the true dilaton. This dilaton (the determinant of the metric tensor in the 
nonlinear case) is required in gravity for constructing local actions (see subsection 
IXA7), but does not contain a physical degree of freedom. The physical polarizations 
of the graviton are contained in the traceless (actually det = —1) part of the metric, 
which describes the conformal part of gravity. The direct products involving ghosts 
also give Sp(2) nonsinglets, which are the ghosts of the massless sector of the closed 
string. BRST transformations (and thus gauge transformations) can also be obtained 


by this direct-product procedure. 


The natural coupling of background fields in the classical mechanics of the string 
reflects this direct-product structure, as seen in subsection XIA5. This means that 


the background metric as we have defined it has as its determinant not the usual 
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one, but that times a power of the physical scalar: It is a physical degree of freedom. 


T-duality mixes physical degrees of freedom with each other. 


If we try to construct a low-energy action for the massless fields of the bosonic 
string, it is not too difficult to find a scalar invariant under T-duality to act as the 
Lagrangian. However, it is impossible to use the usual measure { dx ,/—g because g 
is not invariant under T-duality. This problem is solved by including the spacetime 
dilaton field (X): It couples to the string as 
do 


Sea = — 
dil Dar 


—g 5r In &(X) 

where we denote the worldsheet curvature by r(a) (only in this subsection) to distin- 
guish it from the spacetime curvature R(x). (There are also boundary contributions: 
see exercise IXA7.3.) This term can also be expressed as a coupling to the world- 
sheet ghosts (according to the above arguments), allowing the worldsheet metric to 


be completely fixed by gauge transformations, as usual. 


Since there is no X dependence of Sq; for constant dilaton field (no 0X factors, 
unlike G and B), the constant dilaton is invariant under T-duality. Furthermore, since 
it couples to the worldsheet curvature, which counts the number of loops, the dilaton 
must appear homogeneously in the classical action. The dilaton that appears as above 
in the string action transforms as a density under general coordinate transformations, 
allowing the construction of actions invariant under both T-duality and coordinate 
transformations. The resulting spacetime action for the massless fields of the oriented, 


closed bosonic sting is 
Smaasless = fu &(O0 = +R + HH Hate -++ A) 


where Hyp. = SV [a Boe} is the field strength for the axion. T-duality determines the 
only arbitrary coefficient, the relative weight of the LJ and R terms. Note the absence 


1 


of the factor e~*, which has been absorbed into the definition of ®: The covariant 


derivative acting on &, since it is a density that transforms as e~!/?, acts as 
Ve = ee e726 


(We have included a cosmological term, allowed by T-duality, but not appearing at 


tree level.) 


Exercise XIA6.1 
Find the field equations following from this action. Then make the field 
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redefinition = e~'/e%, to find the result: 


) 
— = (Vo)? +O¢-4R+ HH? +A=0 
) 
; => VA + 2HarcV "oh = 
) 
be,m = Figg = 2VaVoP a 5a Abed 


Both coupling constants in string theory can be associated with vacuum values: 
(1) The string coupling appears as the vacuum value of the dilaton, since it counts 
loops. (2) a’ comes from the vacuum value of the (spacetime) metric, as can be seen 
from the worldsheet action. This is the string-gauge equivalent of the fact that the 
gravitational constant naturally arises as the vacuum (or asymptotic) value of the 
metric in ordinary gravity (see subsection IXA5). In string theory, the fact that the 
gravitational constant is a combination of a’ and the string coupling is equivalent to 
the field redefinition from the string gauge to the particular Weyl gauge where the 
Einstein term in the action appears in the usual way. 
Exercise XIA6.2 
This action is in the string gauge (see subsection IXB5). 
a Make the physical scalar explicit in the action by the field redefinition (Wey] 


scaling: see subsection [XA7) 


e i=% Xeq”” 


leaving @ and B,,, unchanged. 

b The resulting scalar action can be (off-)diagonalized by further redefinitions: 
Noting that the known string theories are defined for /D — 1 an (odd) integer 
(5 or 3), write the dimension in general as (for any D > 1, n not necessarily 
integer ) 


Der +1 


Restoring the e~! to the action, redefine 


= r) 


oo eo V2g@ 1)/2(n4 1) gin +1)/2(n—1) y= gig ey 


which also gives the scalars ¢4 the canonical Weyl scale weights, to obtain 
the final result for the Lagrangian L (where S = [ dx e~'L) 


b=¢( 2-429. 4200 5)/(n 1) 772 


+Ag® Lier] ater lite Ly 
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c This redefinition is singular for D = 2 (n = 1). Fix this by making the 


additional redefinition 


db —* pete 


and then taking the limit n — 1. (We can also use redefinitions equivalent in 
the limit, such as ¢_ > get » Show the result is then 


L— +6,(O ln ¢_ — R) + 403.¢7H? + Ad_ 


We can no longer choose the gauge ¢; = 1, since it is now scale invariant, 


but we can still choose ¢_ = 1. 


When applied to the supersymmetric cases (superstring or heterotic string), the 
inclusion of ghosts in the direct-product procedure also gives the auxiliary fields. 
(The dilaton itself is an auxiliary field.) For example, in the N=1 heterotic case, the 
direct product of the physical parts of the vector and vector multiplet give conformal 
supergravity (the supersymmetrization of the traceless part of the metric) and a 
physical tensor multiplet (the supersymmetrization of the axion and scalar). On the 
other hand, the ghosts of the vector multiplet form a chiral scalar superfield; its 
procuct with the scalar ghost of the vector gives another chiral scalar superfield, the 


compensator, containing the dilaton. (See subsection XA3.) 


The two conditions of supersymmetry and that the dilaton must appear homoge- 
neously (quadratically after an appropriate field redefinition) are now enough to fix 
the form of the action (except for the nonminimal heterotic cases, where the open 
string’s scalars introduce extra vector multiplets). For convenience we redefine the 
chiral scalar compensator as ¢ — ¢/3 so that it appears quadratically in the cos- 
mological term [ d*x d?@ ¢?. Thus, by dimensional analysis ¢ now has scale weight 
3. The axial-vector field strength of the axion appears as [Va, Va]G, so G has scale 
weight 2. (Gauge fields are Wey] scale invariant with curved indices for consistency 
with gauge transformations; thus Hinrnp has weight 0 while H,», has weight 3.) Of 
course, these weights also follow from local superscale transformations, the global part 
of which transforms fields as L?” (see subsection XA4). The only action quadratic 
in the dilaton consistent with global scale and U(1) (R) invariance is then (with 
implicit covariantization with respect to conformal supergravity, which makes these 


invariances local) 
S= / dx d*0 ¢¢G V7 + (4 / dx a6 ¢? + he.) 


Exercise XIA6.3 
Use the methods of subsection XB6 to find all of the terms in this action 
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involving only bosonic fields. Compare to the bosonic string action considered 


above. 


Besides T-duality, string theories also have “S-duality” symmetries that are re- 
alized only on the field equations, or after performing electromagnetic-type duality 
transformations on the fields: If we convert G into a second, physical chiral multiplet 


xy by such a duality as described in subsection XB5, the above action is converted to 
S= pu d*6 (66/7? (x + x) + (4 fac a6? + he.) 


After the redefinitions 
vod ¢x>x 


the first term becomes manifestly SU(1,1) invariant (see subsection XB7): 


s= ja d‘0 (dx + xo)? + (4 [ac a0 bth) 


It is now the original T-duality that can be realized only on shell. Also, in this form 
the condition that the dilaton should appear homogeneously is obscured. (Such S- 
dualities were first seen in extended supergravity theories, especially when obtained by 


reduction from higher dimensions, where antisymmetric tensors are often required.) 


Exercise XIA6.4 
Apply the results of exercise XB5.1 to include vector multiplets in the above 
actions by replacing G — G in the first action and performing duality trans- 
formations. (The super Yang-Mills appears in the spectrum from the product 
(vector @ scalars) ® vector multiplet in the heterotic string.) This substi- 
tution is dictated by homogeneity in the dilaton, which prevents the usual 
conformal [ d?6 W? term. Such terms occur naturally in higher-dimensional 


couplings of supergravity to super Yang-Mills. 


Classical and quantum symmetries of mechanics formulations of particle and 
string theories in background fields are often used to derive equations for those back- 
grounds. These features are not peculiar to these theories or their formulations: They 
are a general feature of describing a particle/field of some (super)spin in a gauge back- 
ground. These equations fall into two distinct types: (1) A supersymmetric system 
in a gauge background of higher superspin generates constraints on the background, 
necessary for consistently defining the coupling (see subsections [VC4 and XA1). 

(2) Any gauge system in a background of the same gauge field generates field equa- 
tions for the background (see exercise VIB8.2). 
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For example, the classical symmetries of the superparticle always generate con- 
straints on its background, but give field equations for it only if the number of super- 
symmetries is enough to insure its superspin is as high as that of its background (e.g., 
10D N=1 in background super Yang-Mills or 11D N=1 in background supergravity). 
Similarly, the bosonic string generates field equations for background gravity at the 
quantum mechanical level because quantization is required to reveal the massless 
graviton excited state contained in the string itself. On the other hand, the 10D 
superstring already generates field equations for background supergravity classically, 
since the ground state of the superstring (closed if boundary conditions are ignored), 


the only part that is evident (semi)classically, already contains supergravity. 


7. Lattices 


In string theory there are two spaces, the two-dimensional space of the worldsheet, 
and physical spacetime. In subsection VIIIB7 we considered approximating spacetime 
by a lattice; in this subsection we instead approximate the worldsheet by lattices. For 
the spacetime of QCD we used a regular lattice, representing the fixed geometry of 
flat spacetime. In string theory we considered worldsheets of arbitrary geometry, 
described by a worldsheet metric, so our lattices should be more arbitrary; in fact, 
functional integration over the worldsheet metric must be replaced by summation over 
different lattices. We saw that the topological expansion of QCD in 1/N generated 
polyhedra analogous to the worldsheet, with 1/N acting as the string coupling. We 
therefore identify the Feynman diagrams themselves, with faces chosen by the 1/N 
expansion, as these lattices, to give a more precise correlation between the second- 
quantized path integral of QCD (and other field theories) and the first-quantized path 


integral of string theory. 


Presently the relation between such field theories and string theory is not well 
understood, and has been described only for the bosonic string. Since the bosonic 
string has only the worldsheet metric and spacetime coordinates as degrees of free- 
dom, it corresponds to a (NxN-matrix) scalar field theory. Since a lattice requires a 
scale, while conformal invariance includes scale invariance, we must break the confor- 
mal invariance of the worldsheet. The simplest coordinate-invariant yet scale-variant 
property of a space is its volume, so we add a volume (area) term to the string action. 
Furthermore, to describe interactions we need to include a term containing the string 
coupling constant; in string theory the power of the coupling constant is counted by 


the integral of the worldsheet curvature. Our worldsheet action thus consists of the 
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three terms 
Po J mn 1 
a av 8 aie 5(OmX) + (OnX) ++ (In K)5R 


A lattice version of this action is (with x given in subsection VIIC4) 


1 
S,= ar dy 3X5 — Xe)? + D1 + (in nr) {$01-S$014+501 
(jk) j j (ik) J 
where j are vertices of the lattice, (jk) are the links, and J are the “plaquets” (faces, 
loops). 
Exercise XIA7.1 
Put the particle on a random lattice “Minkowski” worldline. (See exercise 
VB1.2.) Show the propagator for a massless particle, written in momentum 
space, before taking the limit lattice spacing € — 0, is 


2€ 
1H ete? 


Show this has unphysical poles at p? = 27n/e for arbitrary integer n. How 


do these results differ if the propagator is defined for Wick-rotated 7? 


The corresponding field theory is easily found, according to our earlier discussions, 
by (1) identifying the worldsheet lattice with a position-space Feynman diagram (the 
vertices of the lattice being those of the diagram, the links of the lattice being the 
Feynman propagators; see subsection VC8), and (2) using the 1/N expansion to 
associate the faces of the worldsheet polyhedra with the U(N) indices of the scalar 
field (see subsection VIIC4). 


We then can immediately identify the three terms in the string action with their 


counterparts in the scalar field theory: 
(1) The X term gives the propagators, 


(2) the area term (which counts the vertices) gives the vertex factor (coupling con- 


stant), and 


(3) the curvature term (which classifies the topology) gives the 1/N factors of the 
topological expansion. 


Thus, the three constants in the string action can be identified with the mass, cou- 
pling, and number of colors of the scalar field theory. Explicitly, the field theory 


action is 


dex 1, ,-a’'O/2 lin 
So = Ntr Graybr 2 ¢— G=¢") 


Iman DP 
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where we have identified 


— pl — 2_ 2 
G=e 5 Wf, m =a 


and we have put an overall factor of N (associated with 1/h) so that G (and m?) is 
fixed (rather than G times some power of N) when the 1/N expansion is performed. 


(The reverse can be made true by rescaling ¢.) The unusual kinetic operator 
e 20? — Lim? -O+...) 


comes from identifying the second-quantized particle propagator as it appears in the 


first-quantized path integral for the string: 


D 
X; / 
a= [Tenne® = Aer 


(27a) (QnayPP © 


Unlike the spacetime lattice, the worldsheet lattice preserves spacetime Poincaré 
symmetry, so it’s unnecessary to take any limits to define a physical theory (or at least 
taking limits won’t improve the physical relevance of this model). This model thus 
describes a stiff or lumpy string. The usual continuum-worldsheet string then can 
be identified with a particular limit of this more general string. Explicit calculations 
have demonstrated that this lattice regularization of the worldsheet reproduces the 
results of the continuum approach. These results have been limited to spacetime 
dimension < 1 because of the inconsistencies introduced by the tachyon, which is 
the ground state in higher dimensions. Unfortunately, this prevents study of the 
more interesting properties, such as scattering amplitudes and the precise form of 
the potential (we have left n arbitrary in 5S»), since it’s superrenormalizable in D<2 
regardless of its form. However, these limitations probably would not appear in a 


corresponding formulation of the superstring, which has no tachyons. 


An interesting feature of this model is the use of Gaussian propagators to get 
rid of the usual perturbative divergences of momentum integration. Naively, one 
might suspect that such field theories were completely finite. However, we know in 
this case that the bosonic string does have divergences perturbatively in the string 
coupling, and that there are further problems unless D=26. This demonstrates that 
modifying a theory to fix problems seen in perturbation theory does not preclude the 


reappearance of such difficulties nonperturbatively. 
These Gaussian propagators lead to Gaussian behavior of fixed-angle scattering 


(as we will see in subsection XIB6), in conflict with hadronic physics, where power-law 


behavior is observed for partons with large transverse momenta, and is a theoretical 
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consequence of asymptotic freedom with the usual propagators. (In fact, it is the 
main empirical verification of QCD.) 


—x?/t 


Since nonrelativistic first-quantization gives Gaussian propagators e , 1t is not 


surprising that the simplest strings should result in partons with Gaussian propaga- 
tors e~”. However, the fact that first-quantization for particles leads instead to, 
e.g., 1/x? propagators for massless particles in 4D position space suggests that an 
analogous treatment for strings should be possible. We thus attempt to follow the 
derivation above from parton to string, but starting with realistic parton propaga- 
tors. The first step is to exponentiate the propagator so that the exponent can be 
identified with a first-quantized action. The easiest way, and that most analogous to 
the nonrelativistic case, is to use the Schwinger parametrization of the propagator, 


which follows from the appearance of the worldline metric in the action: 


As we saw in subsection VC8, a Feynman diagram in a scalar field theory with 


nonderivative self-interactions is then written as 
[ eetdnssars eo Daag MtsPiy/2-Hes—a5) Pd 
a 


In the (worldsheet) continuum limit of this expression, p becomes a worldsheet vector, 
so T must become a symmetric worldsheet tensor. Since on a regular square lattice 
(“flat” worldsheet) there are two propagators per vertex (for the two independent 
directions), 7 must be a traceless tensor. (This also explains why 7 can’t be just a 
scalar.) Imposing this tracelessness through a Lagrange multiplier A, we can write 


the (Wick rotated) continuum action as 


da 
= / =~ {-iP™ -OmX + 4tmn(P™ P+ Agg™") + V/—glu t (In «) 4 R]} 


Thus 7 acts as a kind of second worldsheet metric. However, since Schwinger param- 
eters are positive, Tn, must be positive definite, and thus a Euclidean metric. This 
also implies that gm, must be Minkowskian, to be consistent with the tracelessness 
condition. Note that if we set \ equal to a constant, and ignore the positivity condi- 
tion on T, then eliminating 7 by the equation of motion from varying gmn reproduces 
the usual string action, where we can identify a’ = y/(A). This indicates a possible 


approximation scheme. 


The two components of 7 that survive this tracelessness condition correspond to 


the two lightlike directions defined by gmn: If we use a “zweibein”, defined as usual by 
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Imn = —€(m* en) , to flatten the indices on 7, then the Lagrange multiplier constraint 


can be solved by simply setting 7, = 0. The action is then 


2 
o=  maliP C4" OmX + 5T44P_ + Pe + pt (In &)5R| 


Back on the lattice, this implies that the directions chosen by the propagators (links) 
on which P is defined are lightlike. Thus, the matrix model defined by this theory 
should have only 4-point vertices, with the four propagators coming from any vertex 
forming the worldsheet lightcone at that point on the worldsheet. The field theory 


action is thus eo 


dv x 1 124 
S= Nir fr a(-h006 — G40") 
For D = 4, this action describes an asymptotically free theory, “wrong-sign” ¢4 


theory. 


Unlike conventional strings, this “QCD string” has critical dimension D=4 for 
renormalizability. (In conventional strings all momentum integrals are Gaussian and 
thus converge.) Another reason for D=4 is T-duality: T-duality interchanges the 
positions of the vertices with the momenta of the loops. This is clear from our 
discussion of the classical mechanics of Feynman diagrams in subsection VC8, if we 
note that the procedure we used there to translate from coordinates to loop momenta 
is exactly the random lattice version of the T-duality transformation performed in 
subsection XIA5 (with X as the loop momenta). Thus, invariance of a string theory 
under T-duality must include invariance of the propagators of the underlying field 
theory under Fourier transformation. This is trivial for conventional strings, since 
the Fourier transform of a Gaussian is a Gaussian. However, by dimensional analysis 
(or explicit evaluation: see exercise VIIB4.2), we see that the Fourier transform of 1/p* 
is 1/x? only in D=4: T-duality implies both D=4 and masslessness. Furthermore, 
we can look at interactions by considering the simplest case: The flat worldsheet is 
represented by a regular, flat lattice. For ¢+ theory we have the usual square lattice, 
which is self-dual under switching vertices with loops (T-duality). On the other hand, 
triangular and hexagonal lattices, corresponding to ¢° and ¢° theory, are dual to each 
other (i.e., ¢” is dual to ¢?"/(~?), as follows from geometry). Thus T-duality also 


implies the ¢* interaction. 
Exercise XIA7.2 
Let’s examine T-duality for the random lattice more carefully: 


a Repeat the T-duality transformation of subsection XIA5, but for the QCD 
string (see subsection VC8), without a background (M™" = 7"). Show that 
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invariance under X” — X,,, requires that the matrix 7 also be replaced by 
its inverse, with some factors of the 2D ¢ tensor. (A also transforms; you can 


avoid this complication by using the zweibein form of the action.) 


Write the massless scalar propagator in momentum space of arbitrary dimen- 
sion D as an exponential using a Schwinger parameter 7. Show that after 
T-duality — Fourier transformation combined with 7 — 1/7 (which leaves 


the exponent invariant) — a 7-dependent “measure” factor is introduced 
? 


except for D=4. 
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We saw in subsections VIIB5 and VIIIA7 some unusual features of massless the- 
ories in D=2. Since the mechanics of the string is mathematically equivalent to 2D 
field theory (as the mechanics of the particle is to 1D field theory), we now examine 
such field theories in a little more detail. In particular, since the string we studied in 
section XIA possessed local Weyl] scale invariance on the worldsheet, we are directed 


to 2D conformal field theories coupled to 2D gravity. 


One confusion for beginners in string theory that unfortunately is supported by 
some of the terminology is the distinction between first- and second-quantization: The 
first quantization of the string is often described as “2D (conformal) field theory”, 
with the justification that they are allegedly the same mathematically. In the same 
spirit, one might also say that addition and multiplication are mathematically the 
same, but no mathematician would ever say that when applied to, e.g., the real 
numbers, even though they share some properties. For the same reasons, we must 


distinguish between first- and second-quantization of string theory. 


As we saw in subsection IIIA3, they use different perturbation expansions, corre- 
sponding to whether the f is put in front of the mechanics action of subsection XIA3 
or a corresponding “string field theory” action: (1) For the string, the expansion 
about classical mechanics is an expansion in a’. This is again a JWKB expansion, 
an expansion in powers of momenta, since a’ has dimensions of (mass)~*. (2) The 
expansion about classical field theory is as usual an expansion in the (string) coupling 


constant g. 


lgt-q: al = ha’ 


2nd-q: Sig 


In what follows we will often use the terminology “conformal field theory” to 
describe this situation, keeping in mind that as far as application to string theory is 
concerned a more appropriate term would be “conformal mechanics”. (True conformal 
field theory does appear in string theory when applied to the Anti-de Sitter /Conformal 
Field Theory correspondence, where the relevant 4D conformal field theory is maxi- 
mally supersymmetric Yang-Mills.) The mathematical methods of conformal string 
mechanics are also applied as true 2D conformal field theory in statistical mechanics, 
for the purpose of studying 2D systems, or as a toy model for better understanding 
4D conformal field theory. 
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1. Gauges 


We begin by considering gauge choices for the various forms of the bosonic string 
action presented in subsection XIA3. In direct analogy to the particle (subsection 
IIIB2), the two most useful gauges are the “conformal gauge”, defined by completely 
fixing the worldsheet metric, and the lightcone gauge, which is not manifestly globally 
covariant but is a complete fixing of the residual gauge invariance of the conformal 


gauge. In the conformal gauge we set 


Imn = Mmn 


by using the 2 coordinate invariances and the 1 scale invariance to fix the 3 compo- 
nents of the symmetric tensor gmn. The coordinate part of this gauge is essentially 
the temporal gauge gom = Nom, just as for the particle (—goo = v? = 1). Also as 
for the particle, this gauge can’t be fixed everywhere (see also subsections HIA5 and 
IIIC2), but the equation of motion from the metric is implied everywhere by imposing 
it at just the boundaries in T. In this gauge the equations of motion for X are just 


the 2D Klein-Gordon equation, which is easy to solve in 2D lightcone coordinates: 


BWOX=0 = A SApalt+o)+ AQ =o) 


(We have used 7 + o in place of o* for later convenience.) The constraints are then 
Pe 


yr (Xx (4) = 0. This directly relates to the form of 2D conformal transformations, 


which are infinite-dimensional in D=2: 
is =Wde'de = «=jfao'), o =f4o") 


The constraints are the generators of these conformal transformations. (As described 
in subsection IIIA5, the constraints generate the gauge transformations; the global 


transformations are those that preserve the temporal gauge.) 


For the lightcone gauge, we again fix the (spacetime) +-components of the vari- 
ables, and solve the +-components of the equations of motion (found by varying the 
—-components). Looking at the equations of motion first, using the first-order form 
of the action, 

0= —— + ee eg A (9) tal 


=> (-g)7/?g@mn = (A+B) (€mpA? eng A? — BmBn); A™=PT™, Bm = OmXt 


(as seen, e.g., by using €m,A", Bm as a basis), and 


6s d 
=, eee hee paces d pt? = 
0 X= ) = ral o 0 
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which identifies [do P*° as the conserved momentum pt, up to a factor of 27a! 
(since p is really the coefficient of ¢ in the action, where X(o) = x + ...). Similarly, 
dS/dg"” determines P~™, and thus X~. 


We then choose as our main set of gauge conditions 


Xt=kr PYrr=k 


’ 


for some constant k, which explicitly determines 7, and determines o up to a function 
of 7: An equivalent way to define the lightcone o in terms of an arbitrary spacelike 
coordinate o’ is 


c= co | da’ P(e") 
0 


which identifies o as the amount of momentum pt between that value of 0 and a = 0 
(at fixed 7). We thus have that the length of the string (the range of o, not the 
physical length) is 

a fo Pt = 2na'ptk 


We then need to fix the location of g = 0 as some function o’(T): Since in this gauge 
P= 0 
so P*+ is also a function of just 7, we further fix the gauge for o by choosing 
PH =0 (9) Gm = Tn 


Thus the lightcone gauge is a special case of the conformal gauge, after also fixing 
scale gauge g = —1. For the open string, this almost fixes o’(T) at o = 0, which we 


can take as one boundary: The boundary condition for X* is now 
0O=n-0X*t~ no 


since in this (and any conformal) gauge 0,X ~ NmnP". Thus the normal to the 
boundary must be in the o direction, so the boundary is at constant 0. This means 


we have one constant left to fix: 
o = 0 at one boundary (open string) 


This invariance was left because all our previous gauge conditions preserved global a 
translation. Unfortunately, there is no corresponding convenient gauge choice for the 
closed string, so there we leave just this one invariance. In summary, our complete 


set of lightcone gauge conditions is now: 


gauge : Xt = kr, PEC ake, o = 0 at one boundary (open string) 
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The lightcone action is now, in Hamiltonian form, 


a fe {ey + | a [-XP; + $(a'P? + a!" X??)| } 
Exercise XIB1.1 
Analyze the classical mechanics of the string by approximating o by a set of 
discrete points, so X'(0) > Xnii — Xn, etc. Show that the string then acts 
as a bunch of particles connected by springs, and find all the usual spring 
properties: tension, speed of wave propagation, etc. (Note: You may need 


some lightcone modifications of nonrelativistic variables. ) 


The only distinction between open and closed strings is the boundary condition 
(since closed strings by definition have no boundary). For closed strings we have only 


periodicity in o (by definition of “closed”), while for open strings we have 
KO) = 2 (al =0 


One consequence, as we just saw, is that closed strings have one residual gauge invari- 
ance in the lightcone gauge. As described in subsection XIA3, these two strings can 
be made to resemble each other more closely by extending the open string to twice 


its length, defining X for negative a by 
X(t, =o) = X(t, o) 


This is the known as the “method of images”: X (7, —c) is identified with its mirror 
image in the 7 axis, X(7,-—-a). Then the two strings both satisfy periodic boundary 


conditions, while the open string has this one additional condition. We also choose 


1 (open) 7 


k = 2ka'p*, = 
decd " e (closed) K 


so the length of the closed string is 27, while the open string has original length 7 that 
has now been doubled to match the closed string. Our choice of “phase” in relating X 
for positive and negative o for the open string automatically enforces the boundary 
condition X’(7,0) = 0 at one end of the string, while the condition X'(7, 7) = 0 at 
the other end is implied in the same way by the closed string “boundary condition” 
of periodicity, which can be written as X (7,7) = X(7,—7). The picture is then that 
the open string is a closed string that has collapsed on itself, so that for half of the 
range of a X doubles back over the path it covered for the other half. 


Because ¢ has a finite range, X can always be expanded in Fourier modes in that 


variable; the boundary conditions slightly restrict the form of this expansion. We saw 
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that the equations of motion, being second-order in 7-derivatives, gave two modes 
for each initial state: a left-handed one and a right-handed one. We need to be a 
bit more precise about the “zero-modes” (modes independent of 7): We can separate 


them out as 


2ra! ! 
X(7,0)=a2+ 7 PT + Sarre) + yr =z), [4 ¥a)=0 


where Y contains only nonzero-modes. (The normalization of p, conjugate to x, comes 
from the —a-p term in the Lagrangian.) Then x represents the “center of mass” of the 


string, and p its total momentum. Note that this implies X(+) aren’t quite periodic: 


X(4)(o + 2m) = X(4)(0) + 2rKa'p 


Now the periodicity boundary conditions shared by open and closed strings imply 


Y(4) (a =F 2m) a Y(4) (c) 
while the extra boundary condition for the open string implies 


Yuay(0) = Yy(0) = ¥(0) 


allowing us to drop the subscript in that case. Thus, the closed string has twice as 
many modes as the open, except for the nonperiodic part, corresponding to the total 
momentum and average position. This is related to the interpretation that the open 
string is a closed string with its two halves occupying the same path. This doubling 


also shows up in the constraints: For the closed string we have Pos, while for the 


open string we can consider just Poss since P?(c) = = P ‘.)(-@). In the lightcone 


gauge we solve these constraints for X~, by integrating 


“5 £9 +; 2 o . 2 
0 = Pay ~ X(ay = (Xia) — BX) ~ Way + eV 20°9) 


Exercise XIB1.2 
Rederive the solution to the boundary conditions for the open string without 
using X(T, —o0) = X(r,¢) (and periodicity): The string, as originally, extends 


between boundaries at 0 and 7. 


This separation of zero-modes from nonzero-modes also allows us to find the spin 


and mass of the string: In any conformal gauge, 


H 
— 
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1 [7 do ; do ; 
ab _ [a,b] ab __ [a y)] ab __ a b 
J a -a/ wo = ay fey Ya) 


(using the Hermitian form of the Lorentz generators, for classical purposes), where 


for the open string we can replace 


wT Pris 
do _ | a, Ys) 2 Y 
9 27 


i Oe 


For the lightcone gauge we then have the gauge condition to determine X* and 


Virasoro constraints to determine X~: 


‘ : _ 1 a 
Ya) = 0, Kap + ote YQ) ~ QKalpt Gz = pee Yin) 


Exercise XIB1.3 
Consider gauge fixing in the temporal gauge, replacing X* with X°. The 
classical interpretation is now simpler, since tT and X° can now be identified 
with the usual time. Everything is similar except that the Virasoro constraints 


can’t be solved (e.g., for X') in general without square roots. 


a Show that some 3D solutions (2 space, 1 time) for the open string are given 


by, for p' = p? = 0, 


for nonzero integer n. (Without loss of generality, we can choose c real and 


positive.) Find the mass (energy) and spin as 


Find X explicitly, and show it describes an “n-fold spinning rod”. 


b Show that the above solution can be generalized to closed strings by using 
two such Y’s, and fixing the relative magnitude of the two c’s. Consider the 
special cases where n_ = +n,. Find the explicit masses, spins, and X’s, and 
show that one describes another n-fold spinning rod, while the other is an 


“n-fold oscillating ring” . 
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2. Quantum mechanics 


The more interesting features of the string don’t appear until quantization. In 
particular, we can already see at the free level the discrete mass spectrum character- 


istic of Regge theory, or of bound states in general. 


Canonical quantization is simplest in the lightcone gauge. As for particles, canon- 
ical quantization is convenient only in mechanics (first quantization), not field theory 
(second quantization). As can be seen from the lightcone action, the Hamiltonian 
is part of the constraints: For the spinless particle, we had only the constraint 
p? + m? = 0, which became E = H in the lightcone gauge Xt = ptr after iden- 
tifying the lightcone “energy” E = ptp~ and its Hamiltonian H = $(p? + m?). (See 
subsection HIB2.) The string Hamiltonian can be rewritten conveniently in terms of 
P. Since the closed string is effectively just a doubling of the open string, we treat 


the open string first. The Hamiltonian is simply 


where P = P,,). Since we have chosen X* = 2a/p+r, we have E = 2a'ptp~. 
(+) 


To identify the individual particle states, we Fourier expand the worldsheet vari- 
ables in o. As for the particle, we can work at 7 = 0, since all the dynamics is 
contained in the constraints. Equivalently, from the nonrelativistic view of the light- 
cone formalism, we can work in the Schrodinger picture where the 7 dependence is in 
the wave function instead of the operators. We expand as 


Pa=) ae", Gg=V2ap, Ga=a,! 


n=—-CO 


The canonical commutation relations for P and X are 
[Pi(o1), X;(02)] = —271d(a2 — 01) di; 


as the direct generalization of the usual |p, q] = —i. (The 27 is from our normalization 
do /2r.) From the definition of P, we then have 


[P:(o1), Pj(o2)] = —27id' (a2 — 01) 63 


We can then decompose this into modes by multiplying by e”1+"72) and inte- 


grating, where 
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We then find 
[Gavas Qin] = MOm-+n,00ij 


as well as the usual [p;,7;] = —id;;, and thus can relate the modes to the usual 


apo 
harmonic oscillator creation and annihilation operators: 


Ci — Vnan, Op = Vnan' => ae An] = Onn 


for positive n. After normal ordering, we find for the Hamiltonian 
CO 
H= a''p? N= Qo, N= S > nain' ain 
n=1 


> H-E=d'(p?+M”), M? = a’'(N — a9) 


for some constant @p, which we introduce as a “renormalization constant” for remov- 


ing an infinity in normal ordering. 


From the expression for the mass in terms of the number operator N, we see 
that the nth oscillator aj,‘ raises the mass-squared of the ground state |0) by n 
(and similarly for multiple applications of these oscillators). For any given mass, the 
highest-spin state is the symmetric, traceless tensor part of multiple a;,'’s acting on 


|0): This describes the leading Regge trajectory, with spins 
j=a'M*+a9 


Let’s look first at the first excited level, obtained by acting on the scalar ground 
state |0) with the lowest-mass oscillators a;;'. Clearly this describes a (lightcone) 
transverse vector, with no Stiickelberg scalar for describing a massive vector. (l.e., it 
has only D—2 components, not the D—1 necessary for a massive vector.) Thus this 


state describes a massless vector, so 


aj=1 


The ground state is then a scalar tachyon with M? = —a’"!. 


For any given level 
past the first excited level, one can check explicitly that the states coming from the 
various oscillators include the necessary Stiickelberg fields. For example, at the second 
excited level, ay'ajit contains a traceless, symmetric tensor and a scalar (coming from 
the trace), while aj’ is a vector; they combine to describe a massive tensor. The 
proof that this works to all mass levels is closure of the Poincaré algebra quantum 
mechanically: The only nontrivial commutator is [J~‘', J~4] = 0, since only J~ is 
higher than quadratic (cubic) in oscillators (from the form of X~ and P~ after solving 


constraints), so normal-ordering ambiguities lead to more than just constant terms. 
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For reasons to be explained in chapter XII, the algebra(ic computations) in calculating 
this commutator in any first-quantized theory for the lightcone gauge is the same 
as the first-quantized BRST algebra for general gauges. Of course, the proof of 
closure is already the same in principle because both algebras are a consequence of 
the constraints, the conformal algebra. Thus, any anomaly must show up in the 


conformal algebra itself, which will be considered in subsection XIB4. 


Exercise XIB2.1 


Check the third excited level massive representations. 


The closed string works similarly to the open, but with two sets of harmonic 
oscillators, and with 


Pi) = P(-) = 3P 


In that case we find 


M* = 2a!"(Nay + N-) — 2) 


where N(~) and N_) are the number operators for the two independent sets of os- 
cillators. In the lightcone gauge the closed string has the residual gauge invariance 
generated by [ do X’-6/6X; this gives the residual constraint 


The closed-string states are thus the direct product of two open-string states of the 
same mass: For example, the ground state is a scalar tachyon with M? = —2a’~!, 
while the first excited states are massless ones from the product of two vectors — a 
scalar, an antisymmetric tensor, and asymmetric, traceless tensor. The leading Regge 


trajectory consists of states created with equal numbers of ain(4)'’s and ai_)'’s, with 
j =ta'M? +2 
In summary, the leading trajectory for open or closed string is given by 


(open) 


(closed) 


Covariant quantization of the string can be performed in several ways: One is to 
use the OSp methods of chapter XII, as applied to the Lorentz generators derived 
from the lightcone analysis (see subsection XIIB8). Another is to use the usual BRST 
of subsection VIA, as applied to gravity in subsection IXB1, treating the mechanics 
of the string as a 2D field theory. For the case of the conformal gauge, introducing 


ghosts C™ corresponding to the gauge parameters, and antighosts B,,, paired with 
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the Lagrange multipliers of the gauge conditions (Nakanishi-Lautrup fields), we find 
the ghost action 


aa _ qo, 5 gq" -_ Vir”) — Le: 


> be = Bae VOC +B Vic, 


where in the last step we have introduced a background “zweibein” for applications 
such as the background field gauge, or geometries that do not admit the conformal 
gauge globally, and flattened the indices on the ghosts so the tracelessness of B 
(which follows from that of 6,/—gg™") can be solved explicitly. (The Weyl scale 
transformation of the gauge-fixing condition for the conformal gauge does not involve 


derivatives, so the Wey] scale ghosts are just algebraic.) 


In covariant gauges, for purposes of calculating more complicated quantities than 
the spectrum, it will prove useful to work directly in terms of 2D (conformal) field 
theory in the position space of the worldsheet, rather than Fourier transforming to a 
mode expansion. As usual, we Wick rotate to Euclidean space, after which we work 
in terms of complex coordinates 

p=T+It0 
Then the usual spin $ fields on shell are directly functions of just p or p, while the 
usual scalars break up into a sum of both (see subsection VIIB5). In particular, for 


X we can write 

X = / $[X1(p) + Xa(P)] 
where we have introduced normalization consistent with earlier parts of the book, 
since the action we used in for a scalar @ is a’/2 times what we used for X in 
subsection XIA3. 

As described above, for the open string we always combine the two chiralities, such 
as W,(p) and Wp(p), on the interval o € [0,7], into a single chirality on o € [—7,7] 
as 

bo) = A(o)bx1(c) + O(—0)bn(-<) 
so the open string then looks like a closed string with one handedness. We can then 


use the same function w(p), evaluated in different halves of the complex plane, to give 
wy and wr, which are both defined in only one half: 


In particular, the previous separation of X into its chiral halves becomes 


X= Vf F1X(p) + X()] 
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for the open string, while X;, and Xp remain independent for the closed string. On the 
boundary of the open string, where we place vertex operators for external open-string 
states, we have simply X = V2a/X. 


As in 2D electrostatic problems, it’s often convenient to use conformal invariance 
to transform various surfaces with various topologies and boundary conditions to 
ones with boundaries whose shapes are simple enough (e.g., straight lines) to use 
techniques like the method of images to solve for propagators. (Otherwise, we are 
restricted to looking at just short-distance behavior, which is independent of the 
boundaries.) For now we consider just the simplest examples, the strip (open string) 
and cylinder (closed string). In general (e.g., interactions in the lightcone gauge) we 
would need to consider strings of various lengths; for now we simplify matters by 
assuming the length of the string has been scaled to 7 for the open string and 27 for 
the closed, for reasons explained previously. (For lightcone treatment of interacting 
strings, the string length is proportional to p*, so length is “conserved” when they 


split or join at the ends.) 


Z 


We then map the open string to the upper-half plane, or the closed string to the 
whole plane, via 


z=eP 


Since p = T +10, any closed string at fixed 7 is mapped to a circle, while any open 
string is mapped to the upper half of a circle. The two boundaries of the open string 
at o = 0 and z are then mapped to the positive and negative real axis, while the ends 
of either string at t = —oo and +00 are mapped to the points z = 0 and oo. The 
fields will be singular at z = 0 and oo, so they should really be thought of as singular 


limits of circles. (z = oo isn’t really much of a point anyway.) 


If we use “Osterwalder-Schrader reality”, determining reality in Euclidean space 


also by Wick rotation from Minkowski, then p is pure imaginary (o is real, 7 is 
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now imaginary), so under Wick roation of Minkowki complex conjugation, we find 
z—1/z. As usual, this switches tT = too as z = 0 — z= oo. The reality condition 


on real 2D fields is therefore 


For the closed string the scalar propagator is then as we found in subsection 
VIIB5, while for the open string we use an image in the lower-half plane to give the 
appropriate Neumann boundary conditions (vanishing of normal derivative) on the 
real axis: 


GaosealZ, 2) = —In(|z — 2’ |?) 
Gent; z') = Gaveeal 2; z) ot Gasca, Z') = —In(|z _ ay) _ In(|z _ Z|) 


where the normalization is 
(6¢)=G or (X X)=td'G 


for the conformal scalar of subsection VIIB5 and the X of the string. 


Note that propagators do not transform simply under conformal transformations, 
because of conformal-weight factors (see subsection XIB4). However, the vacuum 
itself is not invariant under conformal transformations. The vacuum |0) we use is the 
one natural for identifying the string with the entire complex plane (and the one that 
comes from the path integral in these coordinates, using “1” for the vacuum wave 
functional). It is with respect to that vacuum that the propagators take the simple 


form we have used above. 


3. Commutators 


Since for the most part we will be interested in free fields, quantization will be 
described most easily by the path-integral method. Although 2D field theory already 


looks quite different from the 1D field theory of particle mechanics, free 2D massless 


fields depend on only one of the two lightcone coordinates o* (or are the sum of 
two such terms), and hence 2D conformal field theory is similar to 1D massive field 
theory. Consequently some of the features of particle mechanics or nonrelativistic 
field theory, such as the commutator, can still be useful and 2D Lorentz covariant. 
In particle mechanics, the (equal-time) commutator is evaluated by path-integral 
methods as 


((A, B](t)) = lim(A(t + €)B(t) — B(t + 6) A(t)) 


EeE— 
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(and similarly for the anticommutator), since A and B are treated as classical func- 


tions when evaluating the path integral 
(y= f De fes 


where now “( )” refers not to just the vacuum expectation value, but incorporates 
arbitrary initial and final states through the boundary conditions, or explicit wave 
functions in the path integral (see subsection VA1, and XIB6 below). In general, this 
definition of ( ) actually gives the time-ordered expectation value, as follows from 
the derivation of subsection VA1: The e’s were introduced to enforce the appropriate 
ordering. For the rest of this subsection time ordering will be implicit in expectation 


values. 


In the Hamiltonian formalism for ordinary quantum mechanics, we have the term 
— f{ dt gp in S, which defines qg and p as canonically conjugate, and gives the propa- 
gator 
id, sie(t — t') = 6(t-#’) 


As a result, 


lim|[q(t + €)p(t) — p(t + €)q(t)] = zie(e) — gte(—e) = 


Ee 


Similar results can be obtained in Lagrangian approaches, where for fields satisfying 


second-order differential equations we use the propagator (see subsection VIIIC5) 
adit —t'| = s(t—t%) 


Exercise XIB3.1 
Find the equal-time commutator of a massive scalar field with its time deriva- 
tive, in arbitrary dimensions, from the propagator. (Hint: Start with the form 


expressed in terms of time and spatial momentum, and Fourier transform.) 


For the analogous result in (Wick-rotated) D=2 we consider again the fermionic 
L = dy. The € regulator used in subsection VIIB5 is not needed; now the € we use 
for time ordering plays that role. Using the fermionic propagator 1/(z — 2’) (from 
subsection VIIB5), and the identity (see exercise VA3.1) 
i) 
u—ie urtie 


= 27i0(u) 
we find 


es & i 1 
lim (B(r +6, 0)B(1, 0") ~ Br 0) +60] = ae aH 
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= 27d(0 — 0’) 


Although we have taken the commutator at equal 7, in 2D conformal field theory 
the fields’ time-dependence is given by their depending on just z or just Z, and since 
even interacting string calculations in terms of these “free” (with respect to first- 
quantization) 2D fields factorizes into two separate calculations for the left-handed 
fields and for the right-handed fields, we generally treat just z or just Z as the only 


argument. 


In arbitrary dimensions, we can similarly evaluate commutators of conserved 


“charges” with local operators as (under suitable boundary conditions) 
Q = [as J = fers j™ 


+ [Q,Als)} = ¢ d5!, "(2")A(e) 


where the last integral is over a boundary enclosing x. (Conservation implies the 
result is independent of the boundary. In the 1D case such a boundary consists 
of just 2 points.) The graded commutator [| } (commutator or anticommutator, as 


appropriate) is automatic in the path integral because of the classical grading of the 


L, ®-9-6 


Since such “surface” integrals in D=2 are basically contour integrals (see exercise 


variables. 


IIA1.2c), working with functions of just z and not Z, this becomes 
dz dz! ! 
= ~ — => = >? — A 
Q= PSI > 1A@}= § FI )AW 


where the contour that gives the commutator encircles the z where A is evaluated. 
Then we can avoid taking a limit, since the contour just picks up the simple pole at 
z' = z. (For the cases of interest, the current has both divergence and curl vanishing, 


so the time component of the current is the sum of chiral and antichiral parts.) 


In fact, this is the only contour that is relevant: We need not define the contour 
for Q as an abstract charge, only a contour for how it acts on an operator. States 
themselves are defined by operators: For example, we can pick an arbitrary point in 


the complex plane as t = —oo (because of conformal invariance); z = 0 is conventional. 
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A state, defined as an operator acting on the vacuum, is then represented as that 
operator at z = 0. (This is true also in path-integral language, where the operator 
is included in the path integral as the wave function of that state.) Later times are 
circles about this origin, so Q is integrated about such a circle, representing its action 
on that state. But we see that this is the same as evaluating the commutator of Q 
with its corresponding operator, which is consistent if Q vanishes on the vacuum. 
This picture is natural in Euclidean space, where there is no time in the usual sense, 
and will prove particularly useful later for string theory, where the worldsheet time 


is unrelated to physical time. 


The contour integral definition of commutators is also convenient to avoid wor- 


rying about time ordering and limiting procedures. For example, 


=>  {v(2), o(2)} = 2nid(2! — 2) 


directly picking up the contribution from the pole in z’ — z (see exercise VA3.1b), 
where ¢ is an arbitrary (classical) function, and the 6 function in z is understood at 
equal times as 2776(z' — z) — 276(0’ — oc). More generally, derivatives of 6 functions 


follow from more singular terms: 


4. Conformal transformations 


Conformal invariance in D=2 is infinite dimensional, and looks like two copies of 


general coordinate invariance in D=1: 
ay a2 =hiezdedz. = 2 =7@),. he2) (es \(o7) 


Effectively, we can treat dz and dz as independent (except for complex conjugation) 
1D line elements. From the above examples of the scalar and spinor (and we know 
is also true for the reparametrization ghosts) we see that fields generally depend (on 
shell) on just z or just Z (“chiral” and “antichiral”, or “holomorphic” and “antiholo- 
morphic”, or “left-handed” and “right-handed” ), except for the scalar, which can be 
written as a sum of two such terms. (There is some ambiguity on what to do with 
the zero-modes, which we’ll have to deal with separately. But 0¢ and 0¢ don’t suffer 


from this problem.) 
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As usual the conformal transformation of a field depends on its scale weight w 
(which is the same as the 2D engineering dimension): If we think of a field y as a 1D 


tensor, we can write 

(d2')"xX(wy (2) = (dz) xX (wy (2) 
for a field with w covariant 1D indices, but since the 1D index takes 1 value, we can 
trivially generalize to w that is negative and non-integer. We can also write this as 


he ey ony 
wil?) = (F) X(w)(2) 


For example, for the scalar and spinor we have 


w(0¢)=1, w)=4 


Of course, as in general relativity, not everything is a tensor: For a scalar ¢, ¢ and 
O¢ are, but OO¢ isn’t. Unfortunately, after eliminating the 2D metric, we have no 
1D metric left to define 1D “covariant derivatives”, so we’ll have to live with such 


noncovariant objects. 


Now that we know how conformal transformations act on fields and how to eval- 
uate commutators easily, we can write expressions for conformal generators, i.e., the 
energy-momentum tensor T. For the infinitesimal form z’ = z — A(z) of the above 


conformal transformations, working with just one chirality for convenience, we need 


Qnt 


_ if ZN2/)T(2")s xiup(2)| = MC2)(Oxt0)(2) + (PVD (2) 


We first consider the case where y has an action linear in derivatives: 


L= Nae OXan 


for either fermionic or bosonic y. (In the fermionic case and with w = 4 we can 
identify y with x and include an extra normalization factor of 5:) By conformal 
invariance, the canonical conjugate y of y has weight 1 — w, so both the left-handed 
(z) weights (from y and Y) and right-handed (2) ones (from 0) sum to 1 (for invariance 


under dz dz integration). Using the propagator 
1 


x(z)x(2') © ea 


it’s easy to see that to give the above transformation law TJ must consist of terms 


with one derivative, 
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where the coefficient of the second term is obvious from the special cases w = 0, 5, 1. 

Originally this Lagrangian came from a 2D coordinate invariant one with a 2D 
metric (zweibein), and the conformal weight w came from the Lorentz weight (“spin”, 
but in D=2 the Lorentz group is Abelian: SO(1,1)=GL(1)), the number of “+” minus 
“_” indices, as used in the 2D spinor notation of subsections VIIB5 and VIIIA7. The 
Lorentz connection was the derivative of the metric, giving the total derivative term 
in T, which was the variation of the action with respect to the metric. (In Minkowski 
space this chiral part of Tis 7,1; the tracelessness condition T’.. = 0 follows from 


conformal invariance.) 


This includes the classical string action 


Pi2Pt Pe eer 


as well as the conformal-gauge ghost action 


Beg V Co BVA, 


(The factor of e~! can be absorbed into the fields by a local scale transformation; it’s 
irrelevant for defining T+...) 
Exercise XIB4.1 


In general, the commutators we write for conformal transformations come 


from the singular (pole) parts of products of unintegrated quantities. 


a Show that in the special case of an operator with weight w = 1, we can write 


—F x1 (z) 


T(z)xqay(2’) ¥ — 
This is the case where ¢ x is conformally invariant. 
b Another interesting case is w = 2: Show 


T(2)xey(2) ® ——aaleinl2) + x2") 


c Show that T itself has weight 2, using the Jacobi identity. 
d Show directly that T has weight 2 by evaluating TT using the explicit expres- 


sion in terms of x’s. Consider only the semiclassical (1-propagator) terms. 


(2-propagator terms will be considered below.) 


Closure of the BRST algebra, or lightcone Poincaré algebra, is nontrivial because 
of the infinite summations over oscillators, or equivalently because of integration over 
the two-dimensional “momentum” (which is quantized as mode number in the o di- 


rection because of the finite extent of 7). As usual, BRST invariance is equivalent to 
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gauge invariance, and we can check for anomalies in the usual way, now applied to the 
2D “field theory” corresponding to the mechanics of the string. The anomaly calcu- 
lations are similar to those applied to the Schwinger model in subsection VHIJA7. (In 
particular, see exercise VIIIA7.1.) The gauge invariances in question are coordinate 
invariance and local scale invariance, whose preservation is the vanishing of the diver- 
gence and trace of the energy-momentum tensor. As seen from our analysis for the 
Schwinger model, this implies that the quantum corrections to the energy-momentum 
tensor must themselves vanish when the external fields are restricted to gravity only. 
The calculation again involves one-loop propagator corrections; performed in posi- 
tion space, we get the product of two propagators between the same two points, with 


various numbers of derivatives acting on either end of either propagator. 


OF 
We start with the case of fields with first-order field equations, whose T’ we rewrite 
as 
T = (1— w)xdx — w(0X)x 
The 2-propagator terms correspond to 1-loop propagator corrections for the 2D gravi- 
ton, which couples to 7’. These should vanish for the 2D metric to consistently remain 


gauged away, or in other words, for conformal invariance to be preserved at the 1-loop 


level. (There are no higher loops for this calculation because the theory is free.) 


We get 2 kinds of terms, depending on whether both derivatives from the 2 
T’s hit the same propagator, or one hits each. In the former case one gets a term 
proportional to z~100z~! = 2274, in the latter (@z~1)? = z~*. (The z~* means that 
the contribution to the commutator is proportional to 0°6(z — z’).) The anomaly is 


thus proportional to 
2[2w(w — 1)] + [w? + (ww — 1)"] = 6(w — 5)’ - 5 


with an extra minus sign if the fields were fermionic (from the usual reordering). For 

example, a complex fermion with w = 5 gives a contribution +3, while a pair of real 

chiral bosons with weights w = 0,1 give a contribution of 1. Thus, as we saw from 

bosonization, a complex fermion with w = $ gives the same contribution as a single 

real chiral boson with weight w = 0. 

In fact, we can use the above expression directly to obtain 7’ for the second-order 
action for a single real chiral boson: For the complex case, take either of 
w=0 > T=xXdxy; x=, X=Od 7 

a. 7 = T= (0¢)(0¢) 
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where the substitution is justified by the relation between the @ and y propagators. 


The anomaly for the complex case is then just twice that for the real one. 


For the bosonic string, we have D real scalars, and fermionic ghosts with w = 


—1,2. The anomaly is then 
bosonic : 4D—13=0 => .D=26 


The superstring can be treated with variables that are the worldsheet supersym- 


metrization of those of the bosonic string (“Ramond-Neveu-Schwarz formalism”). 


Thus, there are D real fermions (w = 5) as supersymmetry partners of the D X’s 
(w = 0), as well as bosonic ghosts (w = —3, 2) as partners of the fermionic ones 


(w = —1,2). (The weights —1 and —% correspond to those of the gauge parame- 
ters of coordinate and supersymmetry transformations, while the weights 2 and 3 


correspond to those of the worldsheet metric and gravitino.) Thus 


RNS: 45D+45D-13+%=0 => D=10 


Consequently, we have two conditions on known strings that make them unsuit- 
able for describing mesons: unphysical intercept ao for the leading Regge trajectory 


(massless particles) and unphysical spacetime dimension D. 


5. Triality 


For the case of a Lagrangian quadratic in derivatives only bosons ¢ are interesting. 
Then we can consider adding a term to the Lagrangian proportional to Ré, where R 
is the (2D) curvature, proportional to the second derivative of the metric. We then 
find 

T = 3(0¢)* — node 


where yz is the coefficient of the curvature term. The above transformation law is 


then modified except for 4 = 0 (where w = 0) to 


_ if 2 N(eVP(), 6(2)] = A)\(08)(2) + uO) 


277% 


This inhomogeneity (and familiarity with bosonization: see subsection VIIB5) 


leads us to consider the fields 


774 XI. STRINGS 


for some constant a. Since the yz term is linear, it doesn’t affect the propagator of ¢, 


so we have 

x(2)x(2!) & (2-2) 
and we still need a = 1 to get canonically conjugate fermions (with the usual anti- 
commutation relations). If we now evaluate the conformal transformation of x using 


the T' of @, we find, using 


tagp(z 1a tagp(z 
(06)(2')e? ——e o(z) 
Z HZ 


that it transforms covariantly, with 


w(x) = 5a” + ia, w(X) = $a? — ipa 


where the a? term comes from a 2-propagator term, and is thus a (“anomalous”) 
quantum correction that would not be seen from a Poisson bracket. In particular 
a=1 => w(x)=5tin, w(X)=o-th 


The last result should have been expected from the expression of T’ for y, since the 
w — 3 term multiplies 0(¥x) = —i00¢ from our earlier study of bosonization (where 


we already found the w = 4 term). 


Exercise XIB5.1 


The zya term in w is classical, since it comes from a single propagator: 


a Derive the Lagrangian for a scalar with R term by starting with the nonlocal 
term R(1/O)R and applying a local Weyl] scale transformation, introducing 


the scalar as the compensator. 


b Find the classical scale weight of e’*® from its local scale transformation. 
(Hint: In deriving the local scale transformation in part a, an exponential 


will be needed, so that e® transforms homogeneously. ) 
These results generalize easily to general linear exponentials for multiple scalars 
eiaig 
Including an indefinite metric 7,;, 
T = 3mj(99')(09’) — 1006" 
will modify all the above expressions in an obvious way, including the metric (or its 


inverse) where needed to contract indices of the same kind (both up or both down). 


For example, a? — a-a and ya — p-a in the expression for w, and 


eit 9(2) pia-9(2') py (z _ g eee de ene) 
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The most important use of such exponentials, outside of bosonization, is for ex- 
ternal fields (or “vertex operators”): The conformal generators (energy-momentum 
tensor) have conformal weight 2; requiring that background fields preserve conformal 
invariance implies that such vertex operators must have conformal weight 1 and be 


local on the worldsheet (see exercise XIB4.1): 


Pe)Pe) » yl) +P) 


A 


T(z) =T(z)+W(z) => Whasw=1, W(z)W(2') 0 
= W(2z) = 2nid(z — 2)W(z) 


where we have assumed the conformal anomaly cancels (or ignored its contribution), 


and solved for closure of the algebra perturbatively in the background. 


If we write a background spacetime field @(X(z)) as a Fourier transform, then we 
see that its conformal weight is proportional to k?, the square of the external momen- 
tum. Hence a vertex operator consisting of just a scalar field produces the tachyonic 
ground state (w = sk? = 1). Excited states are created by products of derivatives 
of X times fields (with spacetime Lorentz indices contracted); the derivatives add to 
the conformal weight, forcing k? to decrease in compensation, resulting in massless 
and massive (m? > 0) states. For example, 0X already has w = 1, so the vector 
multiplying it must have k? = 0. 

Exercise XIB5.2 

Show this without Fourier transformation: Evaluate the conformal transfor- 
mation of an arbitrary function (not functional; this is a particle field, not a 
string field) (X(z)), using T = 1(QX)?. Show that transforms covariantly, 
with the number w replaced by —O (with respect to x ) acting on ©. 


Bosonization can also be applied to representations of groups (Lorentz or inter- 
nal). In particular, to obtain the correct anticommutation relations for a fermion e’*? 
and its conjugate e~*“? we require a? = 1; to get the usual conformal weights, we 
require js = 0. 

The Ramond-Neveu-Schwarz formulation of superstring theory uses fermions that 
are a representation of SO(D) (especially D=10) by simply carrying a D-valued (vec- 
tor) index. The simplest way to obtain these fermions from bosonization for even 
D is to define a (D/2)-vector ¢’ with n;; = 6;;. Then we find for our SO(D)-vector 
fermion 

vector : a= (1,0,0,..50),. (Op21, 05.20), 
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in a (complex) null basis. This construction follows that for the Dirac matrices in 
subsection XC1: Each scalar corresponds to a two-dimensional subspace of SO(D), 
and each component of a (D/2)-vector a; is the corresponding eigenvalue of the two- 
dimensional spin. Klein factors should be included to make fermions using different 


scalars anticommute (see subsection [A2). 


From the above we can also find the generators of SO(D): The raising and lowering 
operators come as for y-matrices from multiplying fermions from different pairs, then 
integrating. This gives expressions like the above, but with different a’s (sums of 
2 different a’s of the vector). For the Cartan subalgebra, taking products from the 
same pairs and subtracting the divergent constant gives 0¢°. But then from the 
product relation for 0¢ times e’? we see that a; are the weights of the representation 


(eigenvalues of the Cartan subalgebra). 


Then we can try to make SO(D) spinors the same way: We try the weights (also 
obvious from two-dimensional spinors as the square root of two-dimensional vectors, 


and how spinors come from direct products of two-dimensional spinors) 
spinors : a; = (+5,45,...,45) 


where all the +’s are independent, except that their product is +1 for one Weyl spinor 
and —1 for the other. (The conventions are slightly different from subsection XC1: 


Now we use a representation where o3 is diagonal, and y_, is chosen as the last ¥.) 


However, these spinors can have the usual commutation relations and conformal 
weights only for D=8. This is significant for two reasons: (1) D=8 is the number of 
physical (i.e., transverse) fermions for the RNS superstring, and (2) SO(8) is the only 
simple Lie group with the property of “triality”, a symmetry between the vector and 
two spinor representations. In fact, if we start out by defining the basis for one of the 
spinors with the same a we used above to define the vector, and rewrite the above a’s 
for the vector and other spinor in terms of that new basis, we see that we have just 


permuted the 3 a’s. 


This relation between (fermionic) vectors and spinors is important for superstrings 
because it relates in several ways to supersymmetry. For example, we know that 
supersymmetry representations must have equal numbers of (physical) bosons and 
fermions; in D=10, the vector and (Weyl) spinor both have 8. Since bosonization 
allows bosons to be defined from fermions and vice versa, triality allows the SO(8) 
vector fermion to be defined from either SO(8) spinor fermion, and vice versa. So, at 
least in the lightcone gauge, we can translate anything in the RNS formalism to the 


“Green-Schwarz” formalism, which uses a spinor fermion. This allows supersymmetry 
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(or at least the lightcone version) to be manifest, since superspace is defined by adding 
a spinor fermion to the usual spacetime coordinates. For RNS there is also “Gliozzi- 
Scherk-Olive projection” to get a supersymmetric spectrum: Keeping only integer, 
not $-integer (mass)? for the bosons of the NS string (see exercise XIIC1.1), and using 
a chiral ground state for the fermions of the R string. For GS, this means using states 


created just by one Weyl spinor field and not the other. 


Similar triality constructions apply to lower dimensions by taking into account 
supersymmetry, which also relates a vector to a spinor. In D=6, simple supersym- 
metry has an internal SU(2) (R) symmetry: Thus, there is a triality relating this 


SU(2) to the two SU(2)’s of the lightcone’s SO(4). In terms of these, the vector is 


i a: 


the (4, 3,0) representation, while the spinors are ($,0,4) and (0, 4,4). The resulting 


292) 
operators are given by 


ay = (t35,+75,0), ag =(4 


1 = 1 
4,44 0,+ ag = (0,t4,+ 


ty fA EE) 
VP ya) V2 


To relate to the SO(8) results we use the vector to define the basis, yielding 


ay = (£1,0,0), (0, £1, 0) 


= as= (bbe) (-hpedy) ay = hth) (td 
This also follows directly from the SO(8) result by dropping the third and fourth 
scalars for the vector, and using only (1//2x) their sum for the spinors. (Le., it 
represents only internal symmetry.) For D=4 the construction is even simpler: Besides 
the SO(2)=U(1) of the lightcone, there is a second U(1) for R symmetry. In terms of 
the complex plane defined by these two quantum numbers, there is an obvious triality 


for the three cube roots of 1; thus 


ay = +(1,0), as = +(-§, 


ae 


i ag = +(-1,-%8) 


which again also follows from SO(8), now combining its last 3 scalars. 


Exercise XIB5.3 


We now extend the analogy to the construction of subsection XC1: 


a Show for general SO(2n), in analogy to the Dirac y’s, that the (integral of) 
products of two vector fermions, antisymmetrized in the vector indices, act in 
the same way as the group generators, by examining their commutators with 


each other and with the vector and spinor operators. 


b Show for the triality cases that the (anti)commutator of two representations 


yields the third (supersymmetry). 
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These constructions can be generalized from the lightcone to manifest Lorentz 
covariance by adding equal numbers of scalars of positive and negative metric (at 
least one of each): Their contributions to the spinors’ operator product (power of 
z) then cancel, preserving the anticommutation relations. One of the extra scalars 
of positive metric yields the two “longitudinal” spacetime directions to complete the 
SO(D—2) vector and spinor representations to SO(D—1,1). The rest of the scalars 
come from “ghosts”. Note that the spacetime metric is unrelated to the metric for the 
scalars: The Minkowski spacetime metric comes from Wick rotation of the scalars, as 
applied in subsection XC2 to the construction of subsection XC1 for Dirac spinors. 
For either Euclidean or Minkowski spacetime the basis is null; the only difference is 


in reality. 


6. Trees 


Unlike particles, Feynman diagrams for strings can be treated by first-quantized 
methods for arbitrary loops. The basic idea is that interacting strings are just strings 
with nontrivial geometries: For example, while an open-string propagator can be 
described by a rectangle, an open-string tree graph can be described by a rectangle 
that has parallel slits cut from two opposite ends of the rectangle part-way into the 
interior; this describes initial strings that join and split at their ends (interactions). 
This is the lightcone picture of interactions, where conservation of total pt means 
conservation of the sum of the lengths of the strings. This corresponds to the choice 
k = 1 in the language of subsection XIB1, since the worldsheet coordinates must be 


chosen consistently over the whole worldsheet: 


Xt=7r => 1=2na'pt 


More general conformal gauges are defined by conformal transformations of this con- 
figuration: For example, the boundary of this slit rectangle can be transformed to a 
single straight line by the usual methods of complex analysis, so the worldsheet be- 
comes simply a half-plane. (For the infinite rectangle, relevant for asymptotic states, 
the transformation is p = >> ptln(z — Z,).) Then even the geometry is irrelevant; 
all that matters is the topology, which tells how many loops the diagram has (see 
subsection XIA2). 


B. QUANTIZATION 779 


First-quantized path integrals are then the easiest way to calculate arbitrary 
S-matrix elements in string theory. However, the calculations still can be quite com- 
plicated (as expected from a theory with an infinite number of one-particle states), 
so we first consider just the tree-level scattering of ground states, which is sufficient 
to illustrate the qualitative features. We can start from a gauge where the string is 
an infinite strip (a rectangle of infinite length but finite width), with all but two of 
the external states associated with points on one side of the strip, the remaining two 
states being at the ends at infinity. This is equivalent to a picture of a propagator in 
an external field, with all but two of the external states associated with the external 
field. Similar calculations are possible for particles, but give only a single graph; for 
strings this gives the only graph, since different cyclic orderings are related by con- 
formal transformations. (We are restricted to cyclic orderings by group theory, as for 
the 1/N expansion for particles.) This method can be used in either the lightcone 


gauge or Lorentz covariant conformal gauge. 


We begin by adapting the results of subsection VIIIC5 for the particle to the 
string: We expand the propagator 1/(H») — V), restricting all external states to 


tachyons, where now 
Ho=3x(p? +m), Via ge *© 


for vertices at one end of the string o = 0 for convenience. (All other choices are 
equivalent by duality. When acting on the tachyon ground state we'll see this gives 
the same result as using the usual string zero-mode x after an appropriate limiting 


procedure.) The amplitude is again 


T1—- oo 


Ay=g? lim ee a (0|Vin (7) Vn_1(7N_1).-.Vo(72)Vi(71)|0) 


TN—1=0 
TN +00 


We can again evaluate this operator by a path integral. This time normal ordering 
removes infinite terms coming from connecting a vertex to itself with a Green func- 
tion, and we again keep only terms with Green functions connecting different points. 
(Normal ordering can also be treated in a more careful way by taking the vertices 
to correspond to finite-width strings, as they would in the lightcone approach, and 
taking the limit where their widths vanish.) 


Now we normalize the Green function as 
(X X) = ha’G 


where we have inserted the a’ because of the difference in normalization of the ac- 


tion. We have also used a 7 for the string normalized to a’ = 3, since the “time”- 


development for the string (in normalization where o goes from 0 to (2)z) is really 
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given by a’(p? + M?) = (2a’)5(p? + M?). Thus there is an extra factor of 2a’ for 
each propagator to restore the original generic 1/$(p? + M?). Then the amplitude is 
simply 


71 —— 00 


Ay aga? Tim ee Gene"/ i dN exp 16 do ki Rj Q(F, 75) 


7N-1=0 nce, t<j 
FN +00 TiTi41 


We next make the conformal transformation 
z=e 


used earlier. Vertex operators in general have conformal weight 1 so they can be 
integrated: dzV(z) = dz'V'(z'). This is true for the tachyon vertex e** 


because the tachyon has m? = —2 in units a’ = 4, so the weight $k? = 1. Under this 


on shell 


transformation we find V(7) — zV(z) for the 3 unintegrated vertices Vij, Vy—1, and 


Vy. The amplitude is then 


pe aes i<j 
ZN —0O 
Inserting the propagator 
Gopen(2, 2’) = —In(\z = 2'|?) — In(\z - 2'?) 


where all z’s are real, we find (again dropping the divergent terms from propagators 


connecting a vertex to itself) 


2S2i41 


z4 70 _— 
zN—1=1 t<Jj 
ZN 7 CO 


We can now simply evaluate at z,; = 0 (and zy_; = 1), and use momentum conser- 
vation (and again the ground-state mass-shell condition k? = 1/a’ for the final state) 


to cancel the dependence on zy: 


Ay = 8 *(20/)"-* / a2 TT (gj - a) 


S%41 1<i<j<N-1 


The simplest case is the four-point function (it was how string theory began), 


i 
Ag = g(20’) | deze 
0 
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in terms of the Mandelstam variables 
s = —(ky +k)’, t = —(ki + ka)? 
where the tachyon lies on the Regge trajectory 
a(s) =a's+ao, aj=1 


which we recognize as the Beta function (see subsection VITA2) 


=< = Bl—a(s),-a(?)] = — 


2a'g 

Similar methods can be used for calculating closed string diagrams: There the 
interactions (vertices) are inside the string, instead of on the boundaries, so the 
integrals are over both z and Z, and thus not ordered. Since the closed-string Hilbert 
space is the direct product of two open-string Hilbert spaces (except for momentum), 
for left- and right-handed modes, the closed-string vertices are the product of 2 such 
open-string vertex operators. Also, the exponential of the Green function is the 
product of a function of z times a function of Z (which are no longer equal). Thus 
the z-z integrands are products of two open-string integrands (one for z and one for 


Z), but with p — $p. 


For example, for the closed-string 4-point tachyon amplitude, 


d?z ee oe 
Ascioua~ f (\z|?) za(s) 11 — 2)?) za(t)—-1 


but now 


a(s) = a's +2 


in terms of the same a’ used for the open string. The integral is evaluated in the same 
way as Feynman diagrams (not surprisingly, since those are full of Beta functions, 
too): Consider a 2D, massless, 1-loop propagator correction, where each of the internal 
propagators itself has quantum corrections, and so is some power of p?. This has the 
same form as above if we interpret z and 1—z as the momenta of these 2 propagators. 


We therefore use the usual Schwinger parameterizations 


7 —— ay fer pete 


for f = |z|? or |1 — 2|?, introduce a scaling parameter \ = 7, + 72, 7 = Aa; to get 


Feynman parameters a;, etc. We eventually find 
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which is dual between all 3 channels, where we have introduced wu via the identity 


16 


a’ 


a(m7)=0 => s+t+u=4r’?=- 


to manifest this symmetry. 


If we expand the integrand of the Beta function for the open-string amplitude in 
powers of z, we find (see exercise XIB6.1 below) 


(oe) 


1 = 5 a) + Malt) + = fol) + 1 


=< J! J —a(s) 


which shows Regge behavior: The leading trajectory comes from the t7 term, while 
“daughter” trajectories come from t7~"/(J — a) = t?’/[J' —(a—n)]. Since the 
amplitude is symmetric in s and t, we can also write this as a sum over ¢-channel 


poles: This is duality. 


Exercise XIB6.1 
Derive the pole structure of the 4-point string amplitude by Taylor expanding 


the integrand of the Beta function in z. 


If we go back to the Schwinger parameter via z = e~7 and expand 1 — e~” in 7, 
we find the Regge limit (see exercise XIB6.2 below) 


where higher orders in the 7 expansion give contributions from daughter trajectories. 
The same result can be obtained by using the Stirling approximation for the I”’s, or 
by applying the Sommerfeld-Watson transform on the pole expansion above. 
Exercise XIB6.2 
Use the Stirling approximation (see exercise VIIC2.1) to derive the Regge limit 
of the 4-point string amplitude. Show that the same result can be obtained 
directly from the integral (Beta function) representation of the amplitude, 
where the main contribution comes from z near 1. (Hint: see exercise XIA1.2.) 
Exercise XIB6.3 
Show the Regge limit can be obtained from the Sommerfeld- Watson transform 


of subsection XIA1. (Hint: The Beta function is a sum of Regge trajectories. ) 


Another high-energy expansion is at fixed angle (t/s), as used in perturbative 


QCD (large “transverse” energies). Again using the Stirling approximation, 


lim A, ae ef (cos A)a(s) 
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cos9x1+ a fr 25, (=) + alt (=) 
8 —s t —s u 
This Gaussian behavior in momenta is not the power behavior expected (at lowest 
order) from asymptotic freedom. The interpretation is that the “partons” that make 
up these strings have Gaussian propagators instead of the usual 1/(p? + m?) (see 
subsection XIA7). Similar effects show up if we heat up strings past the “Hagedorn 
temperature” (see subsection XIC2), where they break up into a parton plasma: A 
QCD string would then show quarks and gluons, whereas known strings show almost 


no degrees of freedom, since Gaussian propagators have no poles. 


m 


m+1 


m+n 


Similar analyses can be made for the higher-point functions: For example, we 
can show the poles are in the same places not only from the derivation (splitting 
up the path integral) but looking for momentum-space singularities directly in the 
amplitudes. Consider some number n+ 1 of consecutive (because of ordering) vertex 
insertion points approaching each other: The worldsheet picture is that these external 
lines are much closer to each other than the rest of the diagram, so relatively they 
have been stretched away, emphasizing a propagator connecting that bunch to the 


rest, carrying the sum of their momenta. The z integration diverges in that region as 
0 


where we have “scaled” all n of the converging z; — z;’s by the same variable z to 
treat that region with a single integral, and the sum is over 2 < 7 for those n+ 1 k’s. 


Then using the on-shell condition for the tachyons (in units 2a’ = 1) 


m+n 2 
Ge = (S | =2 So &-kj+2(n+1) 


mi<jomt+n 


we have 
1 


Lot 
Aw | dz g7Smimtn/2-2 
0 Sm,m+n =p 2 


giving the tachyon pole. Corrections to this result come from mutiplying the integrand 


by a polynomial in z, yielding higher-mass poles. 
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There are several ambiguities that might have arisen in calculating the normal- 
ization of these amplitudes, but they all would have only 2 effects: (1) There could 
have been an extra factor of (constant). This could come from normalization of 
the vertex operator, due to normal-ordering prescription (from dropping an infinite 
constant) or coupling-constant normalization. It could also come from choosing the 
2D-dimensionful constant jy in the propagator I/n(y|z|). (In the above amplitudes, 
momentum conservation translates such k;-k; terms into k? terms.) (2) There could 
also have been an overall N-independent constant. This ambiguity could arise from 
normalization of any of the integration measures. Another source is possible ambi- 
guity in definition of the 3 vertices without z integration as compared to those with. 
These 2 effects are identical to those in ordinary field theory: Consider a Lagrangian 


(with normalization appropriate to matrix fields) 
L = ~2716(01 - m?)o ~ 2%gh¢ 


Then the “wave-function” normalization Z combined with the definition of the cou- 
pling g are equivalent to the ambiguities we have described. The normalization of 
g is arbitrary, since it must eventually be fixed only by experiment. The value of 
Z is also a convention, but must be consistent with our convention for calculating 
probabilities. This is not manifest in general conformal field theory methods, since 
only the amplitudes are defined. One way to fix it is to use a pole expansion and 
compare the contribution of that particular particle to the corresponding result of 
particle field theory. For example, we can look at the 3-tachyon amplitude A3, which 


is just a coupling constant: 
As = 9 
and at the contribution to the 4-tachyon amplitude A, from a tachyon in the ¢ channel, 
te? 2 


A 209 g 
* =a(t) — Ef(ky + ky)? + m?] 


2 
| 


Both of these agree with the results obtained from the field theory Lagrangian above 
by the usual Feynman diagrams, with the same g and with Z = 1, as a result of our 


derivation of the string result from assembling vertices and propagators. 


There are also generalizations to strings with worldsheet fermions; the main dif- 


ferences are supersymmetry and D=10 (instead of 26). 
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7. Ghosts 


We did not really prove conformal invariance for the above calculations of scat- 
tering amplitudes, because 3 of the vertices were not integrated over. In particular, 
if we would have picked different z’s for those 3 vertices, the result would have dif- 
fered (though only by a constant in momenta, a function of those z’s). This is the 
problem of determining the right integration measure for the z’s. There are several 
solutions, but the easiest is to use the ghosts. The full reason for the importance 
of the ghosts when calculating amplitudes for states that apparently don’t involve 
ghosts will become clearer later when we examine string field theory (see especially 
subsection XIIB8). For now we note that it is natural to consider the BRST opera- 
tor when considering the definition of physical states, since it defines them in a way 
that is independent of gauge. For this discussion we’ll restrict ourselves to the open 


bosonic string. 


Since the mode expansion was used to define the vacuum, isolate zero-modes, etc., 
we need to look at how this is affected by the conformal transformation that took us 


from the strip to the complex plane: 
Xo) = > Xe” = xy Z= > mer 


This implies a certain definition of the vacuum: T = —oo is now z = 0, so states are 


created by operators that are nonsingular as z — 0. We therefore have 
Yin /0 =U. jor 27> 0 


which differs from the naive x,,|0) = 0 when w 4 0. For example, for the open, 


bosonic string we apply it to 0X, C, and B, to find 
(Gn, D, Cn+1) bn—2)|0) =0 


In relation to the tachyonic vacuum |t), describing an off-shell, zero-momentum 
tachyon, satisfying 
(Gris Ds Cn, Bins bo) |t) = 0 
we thus have 
|0) = b_3|t), \t) = c,|0) 


which describes an on-shell, zero-momentum Yang-Mills ghost (see subsection XIIB8). 


Exercise XIB7.1 
What state is c_,|t)? Check the mass level and ghost number, and that it 
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couples to the right state in the propagator 1/H,. (Ignore the mode co, which 


gets special treatment, as for the particle.) 


One result of this change in vacuum is that the number operator N —1 is replaced 
with N in H because the 1 is canceled by the new normal ordering. (Now 6_, is 
treated as an annihilation operator, c, as creation.) This is clear from the fact that 
|0) corresponds to a massless state, so the p? term (as well as the normal-ordered 
oscillators) in N vanishes on it. 

We know how to get physical fields from the tachyonic vacuum |t), by hitting it 
with physical oscillators. But the tachyon itself is created from the vacuum |0) by 
hitting it with c,. In conformal field theory langauge, since C' has conformal weight 
w=-l, 

C(z) = an =. teeter tee? +... 
=— |= lim C(z)|0) 
This state is off shell, and so is not in the BRST cohomology. An on-shell tachyon is 


described by 
it, k) = lim C(zjel*X@ 0), 2 =2 


As we saw earlier, choosing this value of momentum gives the exponential w = 1. We 


can thus write an arbitrary tachyon state in terms of the tachyon field ¢ as 


lim C(z)6(X(z))]0), (A + 2)d(x) = 0 


20 


(Here ¢ is a function of X(z), not a functional of X.) 


Similar remarks apply to excited states: For example, for the vector, using the 
generalization of the operator OX used for the particle (see subsection VIIIC5) we 
have 

In, k) = lim C(z) x - (OX)(z)e**O\0) k=O 


since OX has conformal weight w = 1. Near z = 0 only the first creation operator in 


OX will contribute when acting on the vacuum: 
(OX)(z) = \ ae =>  lim(0X)(z)|0) = a_,|0) 
z—0 
Thus an arbitrary vector state is 


lim C(z) A(X (z)) -(O8X)(z)|0), OMAg(x) =0 


20 


B. QUANTIZATION 787 


Exercise XIB7.2 
What is the local (in z) operator that, when acting on |0), produces the state 
of exercise XIB7.1? 


More generally, any operator C(z)W(z), which has weight 0 if W has weight 1, 
will create a physical state from the vacuum if ¢ W creates a physical state from 
the tachyonic vacuum. (Taking z — 0 is a conformal transformation, since CW has 
w = 0, and corresponds to choosing a gauge.) Any operator W(z) with w = 1 has 


conformal transformation 


7 f 2 N(e)P(e!), W(2| = (AW) (2) 


271 


so its integral is conformally invariant. If it is ghost free, it is therefore also BRST 


invariant, 
[Q, fW} =0 


(Gauge-invariant operators without ghosts are BRST invariant.) Then CW is also 
BRST invariant: Using {Q,C} = —COC (see subsection IXB1), 


{Q, CW(z)} = {Q, C}W — C[Q, W] = —C(OC)W + Ca(cw) 


= —C(OC)W + C(0C)W + CC(OW) = 0 
Since both ¢ W and CW are BRST invariant, and the vacuum is, they can be used 
to construct a BRST invariant amplitude. 


Massless vertices for the closed string are similarly the product of left and right 


open-string vertices (as are arbitrary vertex operators): 
V = Kimn(OX™) (OX) * 


(The ghost structure is a little more complicated because of the zero-mode for Ty—Tp.) 


The fact that CW’s are needed at all (and not just ¢ W’s) is related to the fact 


that the choice of vacuum does not completely fix conformal invariance: 
14/0) = 19|0) = 24/0) = 0 


since T has w = 2. (The third is satisfied because cob_,|0) = 0.) These 3 operators 
generate Sp(2). (This unbroken invariance is related to the gauge invariance of the 
massless vector.) Fixing this invariance in the path integral requires an extra ghost- 


dependent factor. 
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One way to determine this extra dependence is to note that for the scalar particle 
there is also a ghost, the analog of co for the string. The integration measure is then 


simply | dco, so we have 


particle : [ee @qg=1 + @e=1 


where ( )c means we neglect X. The analog of the scalar for the string is the tachyon, 
so 

1 = (t\co|t)o = (0|c_icoc1|0)c 
which are the ghosts of Sp(2). We then find 


C(z) =... +ce12* +e2+aq +... 
=> (C(u)C(z2)C(z3))o = —(41 — 22) (22 — 23) (23 — 21) 


We can now solve the problem of conformal invariance using BRST: ¢W and CW 
are both BRST invariant, so for general (tree) amplitudes we use 3 CW’s and the rest 
¢W’s. This eliminates 3 integrals and introduces 3 C’s, which produce the above 
factor. Later, when we consider string field theory, we'll see that the reason for this 
counting is that an n-point graph has n — 3 propagators, each of which has a factor 


of ¢ B, so all but 3 of the true vertex operators CW are converted into W’s. 


The (CCC) factor replaces the factor 22, in the open-string tachyon amplitude we 
produced previously by other arguments, and agrees with it for the previous choices 


Zz, = 0, zy_-1 = 1, zn — cw. We therefore replace the previous result with 


An = —(21 — Zn_-1)(Zn-1 — 2) (Zn — 21) ee TG — 2;)Re*s 
i<j 
To show this result agrees with the previous, we use the Sp(2) invariance of the vacuum 
(and conformal invariance of everything else). The (“zeroth-quantized” ) transforma- 
tion on z from the above 3 Virasoro operators are found by noting that on expressions 


@ with conformal weight 0 (no “spin” piece to the transformation), 


‘a ¢ 2 ip > —|T,, d] = 2"*'d¢ 

274 
The finite transformations generated from the infinitesimal ones (n = 0,+1) are 
az +b 
czt+d’ 


(e.g., by examining the infinitesimal case and checking the group property). Then 


ad— bec =1 


Z: > 


we use the fact that such a transformation can be used to transform any 3 points to 
fixed values. In particular, 


&2— &£| ZN-1 — 2N 


Zz— 
Z— ZN ZN-1 — 21 
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transforms z; — 0, zy_1 - 1, zn — o. 


If you know some group theory, you might recognize this representation of Sp(2) 
as the coset space Sp(2)/GL(1), ie., 2-component vectors with an antisymmetric 


metric, with scale transformations as a gauge invariance: 


c= (4), C90 =9G—-OG, =A 


After gauging ¢2 = 1 (for all ¢’s), we then have simply 


— od — / 
8 > ¢€-C=2-2 


But now an Sp(2) (=SL(2)) transformation will change the gauge, so we need to 


supplement it with a compensating scale transformation: 
Zz a Ob Zz az +b 
— — 
1 c ad 1 cz+d 


ol ean _ (aaa) 


czt+d\cz+d 1 


as above. From this the result on z — 2’ is clear: It is invariant under the original 


Sp(2), so its transformation comes from just the scalings, 


z—2! 


oe Cz t dye’ +.) 


as is easily confirmed by performing the compensated Sp(2) transformation. For the 
same reason, dz also has a simple transformation law, so Sp(2) invariance of the 
amplitude is easy to check explicitly, using momentum conservation and the mass- 


shell condition k? = 2. 
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Note that in the following, although we sometimes use operator or path-integral 
notation, we never actually calculate by performing explicit oscillator evaluations 
(using, e.g., coherent states) or the infinite-dimensional integrals of the path integral: 
As we did previously for tree amplitudes, we just use general properties of quantum 
theory. Specifically, we use spacetime or worldsheet Feynman diagrams, which are 
just perturbation theory, but can be derived from oscillators, path integrals, or other 
methods. 


We first examine the planar loop, with external tachyons. There are 3 parts to 
the calculation: 
1) 2D Green function 
2) volume element (or “integration measure” ) 
3) partition function 
The first two we encountered for tree graphs; the last (really a part of the volume 


element, but a new one) comes from summing over the infinite number of states of 


the string that circle around the loop. 


1. Partition function 


As for trees, we generalize the results of subsection VIIIC5 for particles to strings. 
That method allows the volume element to be determined unambiguously. Often 
symmetry arguments are used to determine the volume element, but that has 4 major 


drawbacks: 
1) Sometimes symmetry is not enough even to determine functional dependence. 
2) Symmetry will never determine overall constants, since constants are invariant. 


3) In particular, BRST symmetry only guarantees gauge independence of the result. 


If BRST is used, a separate evaluation in a unitary gauge is needed. 


4) Anomalies can violate symmetries, so a symmetry-independent evaluation is need- 
ed to check for anomalies. (I have even seen symmetry arguments use to conclude 
certain asymmetric contributions must have vanishing coefficient, when in fact 


nonvanishing anomalous contributions were found from direct evauation.) 


In string theory, because of duality, 1-loop graphs can always be represented 
without external trees. Thus, unlike the particle case, we will automatically find the 


S-matrix, and not the effective action, just as for trees the single graph we considered 
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automatically gave the complete S-matrix element for strings only. We now find the 


amplitude 


AY) = / dT d\~'F V(T)exp | —4a! > ky - kyG(%i, 7) 

—00 S$-TSTSTi41 <0 i<j 
(There is also an overall normalization of (2a’g)‘, in analogy to subsection XIB6, 
which we’ll ignore from now on, and we choose units a’ = 4. The trace is normalized 
to agree with that for a single particle, together with a sum over particles, as explained 


below. This “T’” should not be confused with the 2D energy-momentum tensor.) 


The volume element for the string can be factorized into a sum over states (now 
called “partition function”) and an integral over the momentum of each state as for 
the particle: 

Viera) = i as » ee 
states 
To evaluate the sum we use a’/M? = N — 1 and evaluate the sum as the product of 
independent summations over the oscillators of each of the D — 2 = 24 transverse 


directions: Again making the conformal transformation 


and using units 2a’ = 1, we have 


D-2 
os f2 <2 7 
y ener) ) 
D=1 


states 


where N’ = S> na,'ay, is the contribution to the number operator of any one di- 
mension. This sum is itself the product of contributions of any one oscillator to N’: 
For each oscillator we get a sum of terms, one each from each excitation level of that 


oscillator. For the nth oscillator, 


excitations j=0 


The final result for the volume element is then, 


Va (In w) Pw '[F(w)P?, fw) = TJ a-w") 


n=1 


Putting the pieces together, we have (with w < z; < zj41 < 1) 


iA 
= ) exp - s kz ky G( 2/2 w) 


i<j 
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We have summed over just the transverse oscillators, representing the physical 
states. This can be justified by quantizing the string in a lightcone gauge, where only 
the transverse oscillators (but all components of momenta) appear. For lower-point 
functions, this result can be obtained (assuming Lorentz invariance is preserved) by 
working in a reference frame where all the kt = 0, so none of the x~’s, which are 
quadratic in the transverse oscillators, appear. (By Wick rotation, we can always 
assume pt is a complex combination of any 2 spatial components of momentum, a 
“spacecone gauge”, so no restriction is imposed on p® on shell.) A more general way is 
to use a Lorenz gauge (like the conformal gauge). Then the contributions of the ghost 
oscillators will cancel 2 dimensions of the bosonic ones: For any fermionic oscillator 


d,, remembering that a fermionic state gets a minus sign in a loop (so the trace is 


/ 
y wrdan'dn = w" 


excitations 


really a “supertrace” ), 


where we have summed over the 2 excitations, and )~’ means the fermionic term 
gets a minus sign in the sum. (In functional integral language, fermionic integrals of 
Gaussians give determinants while bosonic ones give inverse determinants.) f(w) is 
thus a partition function for 1 fermion. We have dealt with the ghost zero-modes by 
using the b) = 0 gauge. (Similar arguments can be given at 1 loop using just conformal 
field theory, thus showing as for trees how the ghost contribution preserves conformal 
invariance, but they become more obscure at higher loops. A better understanding 


of this gauge comes from string field theory.) 


Exercise XIIC1.1 
All the 2D fields we have explicitly considered effectively have periodic bound- 
ary conditions: They are expanded over e’”’ for integer n. Consider instead 
a single fermionic field with “antiperiodic” boundary conditions (as in, e.g., 
the NS, or bosonic, sector of the RNS string, which we haven’t studied in de- 


(n+1/2)¢ The masses? for the oscillators now goasn+ 5 


tail), expanded in e’ 
instead of n, with the ground state mass chosen so that the first excited state 


is massless. 
a Find the contribution to the partition function for this field. 


b For GSO projection (for supersymmetry), one looks at only masses? that are 
integer (dropping, e.g., the tachyonic ground state). Find the contribution to 


the partition function for this reduced set of states. 


794 XI. STRINGS 


2. Jacobi Theta functions 


It will prove useful to consider a more general type of partition function, one for 
energy and ghost number, for the ghosts in their fermionic and bosonized versions. 


We use the variables 


(We began with T positive, so0 < w < 1, but for the following manipulations 
Re T > 0, so |w| < 1, is OK, and later we analytically continue.) We define the 


partition function 
Z(w,z) = str(e tT!) = trlw™ (e*z)"] 


where for “energy” H and ghost number J we use the equivalent bosonic/fermionic 


expressions 
(oe) [oe] 
H = hp? + Sonal nan = S- n(claba + Bach) + 2 
n=1 n=1 


et 3|Co, bo] + > obs = bl nee) 
n=1 
(Bosonization works the same way for ghosts as for physical fermions, but with 2’s 
missing, since c and 6 are each real, instead of complex conjugates. The : is because 
the momentum p of the boson is an integer plus 3, since total ghost number is, and 
thus H as defined above for the boson has minimum value 3.) We have defined the 
“supertrace” by including a factor of e~’” into the trace: It gives the usual —1 for 
fermion states as defined for fermion oscillators, which we carry over to the bosonic 


formulation. 


Exercise XIC2.1 
Look at the first excited level (corresponding to massless ghost states of the 
string): 
a Express all the states of this level in terms of fermionic oscillators acting on 


the vacuum. Translate this into bosonized operators. 
b Evaluate H and J for these states, and sum their contribution to Z(w, z). 


In terms of the variables 


p 5 Qriv 
v=, T=-— Z=e"", We 
271 271 ( 
if we express the result in terms of the Jacobi @ function, and the partition function 


of subsection XIC1, 


ane) 
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we find the two equivalent forms for the result 


oe) 


6,(v|r) 59 S- Cy 7 a a: 


n=—-C 


= iw (zM? — 2-2) [7 (1 — w")(1 — zw") (1 — 21”) 


n=1 


from the bosonic and fermionic versions, respectively. (For the bosons, the sum is 
over eigenvalues of p. The factor of f in the definition of @ cancels the oscillator 


contribution in the bosonic case, and is included as a product in the fermion case.) 


From either of the above forms we can easily see the Jacobi @ function satisfies 


the “quasiperiodicity” conditions 
6,(v + 1|7) = —6,(v |r), O(v + 7|r) = —e "476, (v|7) 
As a function of v, it vanishes only at v = 0, up to these periods. It is also odd: 
0(—v|r) = —A,(v|7) 
From the second (product) form we also have 
0, (0|r) = 2nw"/*[f(w)]? 


where the prime means derivative with respect to the first argument. 


We will need transformations of these functions under the subgroup of conformal 


transformations SL(2,Z)=Sp(2,Z), namely 


at +b V 
—}; y—- ——— 
cT +d cr +d 


T= 


(or similarly for T and p) in terms of the SL(2,Z) group element 


ee) 


whose elements are integers and determinant ad—bc is constrained to 1. The simplest 


isa =b=d=1,c=0: From the product form of 6;, 


O,(vjr +1) = e'/49, (v|r) 


The next simplest one is b = —c = 1, a = d = 0: This can be derived by 


>, 7 du e2t|-Au? /2+i(B+n)u] 


n=—CO 


considering 
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This can be evaluated by doing either the sum or the integral first. To evaluate the 


sum first, we use the identity 


S- gu ys d(u—n) 


(This can be checked by multiplying on either side with a function and integrating, 
noting that only the periodic part of the function, with period 1, is picked out, so that 
function can be written as a Fourier sum, and the integral evaluated over [—3, 4].) 
Thus we relate the 2 forms 


See — A-1/2 en ee, ee aes 
In this “Poisson (re)summation formula” we make the replacements 
A= -—iTt, B=v—-4rt+34 


(so the integral is well defined again for Re T > 0). After straightforward algebra 


(replacing also n — n — 1 on the right-hand side), we find 


_ ~) = ei (H?/7+1/4) 71/29, (yr) 
os 


From applying this to 64 we find, for w = exp(2z7ir) and w’ = exp|27i(—1/rT)], the 
Hardy-Ramanujan formula 


The general case can then be found by combining arbitrary multiples of these 2, 


V 
] 
(4 


where ¢ is an eighth root of unity. (We won’t need it, since we’ll use only |6,|? below.) 


yielding 
at +b 
cT+d 


) -_ C(er + d) ete" (er+4)9, (y| 7) 


Hagedorn noticed that the multiplicity of observed hadron states as mass in- 
creased (and other features) was characteristic of a thermodynamic system with max- 
imum temperature around the pion mass. In QCD language this is the temperature 
of the deconfining phase transition, above which hadronic matter is replaced with a 
quark-gluon plasma. Later this behavior was found to follow from strings. However, 
in the string case the number of states above this temperature is found to be less than 
that of an ordinary particle theory. This can be attributed to the fact that a ran- 
dom lattice worldsheet approach to quantization of known string theories describes a 


theory whose partons have Gaussian propagators, without poles. 
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To derive this temperature from the bosonic string, we begin with the Hardy- 
Ramanujan formula above: The counting of states at high “temperature” (mass) is 


then given by looking at w = 1 — €, € — 0, which is w’ — 0: 


20 2 
li ry 4{/— e-™ /6e 
lim f(w) rae 


up to terms which are smaller by powers of € (or worse yet, powers of the above 


exponential factor). The number of states at the nth excited level is then given by 


d 2—D 
MCE poe [f(w)| 
277i ayn) 
with the contour a small circle near the origin. We can evaluate this integral by the 
saddle point approximation for the “action” 
oe) 


S = (D—-2)in f+(n+1)in w = —(D —- 2) = 


+ $ln ( —(n+1)e 
D-2 


> ERT 
6n 


1 
> N(n) ~ ae wn Pt)/4 2a r(D-2)/6 


To pick up this contribution we have widened the circular contour to run through the 


saddle point: Since this point is real (and at |w| < 1 so f is still well defined), the 
contour runs in the imaginary direction for the infinitesimal region near the saddle 
point where it contributes, so this point is a minimum. Using n © a’m?, dn N(n) = 


dm p(m), we have 


Dae amine. 1 6 


p(m) ~m m= 5 Boye 


The pion mass is about 27 smaller than 1/Va’ (in terms of the hadronic string tension 


a’), so this is in the right ballpark. 


3. Green function 


We begin our determination of the Green function by analogy with the particle in 
subsection VIIIC5. Again we have only a constant zero-mode, since a constant is the 
only periodic function (since we will need to consider only tori) that is a homogeneous 


solution to the wave equation (as seen by Fourier expansion): 
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in terms of the area of the worldsheet. 


Unlike the tree case, g = 0 and o = 7 correspond to the 2 different boundaries 
of the orientable loop. If all the vertices are on 1 boundary (say, o = 0), we have a 
planar loop, otherwise nonplanar. To include both boundaries, we need the complex 
variable 

p=Tt+ite 

in terms of which the torus coming from doubling the open-string surface is a periodic 
rectangle, with corners at tia, T +i. (For the closed string, this will be distorted 
to a parallelogram, to take arbitrary twists in o into account.) Again as for trees, the 
open-string Green function follows from that of the closed string using image charges 


for reflection about the boundary: 


Gonen| Pi p') = Gassea( P; p') =f Criaseal 4; p) 
Unfortunately, the Green function for the closed string does not quite separate into 
holomorphic and antiholomorphic parts, but this is not a complication for low- 


excitation vertices. 


The closed-string Green function can be found in various ways: For example, 
using the method of images, an infinite sum is obtained, which can be recognized as an 
expression of a Jacobi 6 function. Alternatively, a Jacobi 6 function can be recognized 
as the solution to the wave equation with the correct periodicity conditions. To see 
that 6; is useful for the Green function, we first note that it is analytic, and for small 
p, 91 ~ p, so a —In|6,|? term gives the —In|p|? term of the complex plane, yielding the 
correct 6 function term in its wave equation from its nonanalytic behavior at p = 0. 
Then we see that it has periodicity under p — p+ 277 and almost under p — p+ T; 
the latter is fixed by an extra term, similar to that for the particle, (Re p)*/(Re T) 
(generalizing to complex T for later application to unoriented strings or twisting 
of closed strings), which contributes to the wave equation the extra term inversely 
proportional to the area of the torus 27Re T: With our previous normalization, 


Ary 9 _ 1 
00G = rs (c) set | 


The result for the closed Green function is then 


», (2_|ZE\| , (Re 0? 
"\ Oni | Qni 


Re T 
which appears in the amplitude as G(p; — p;,T’), where we have added a function 


G(p, T) =—In 


+H(T) 


H(T) that is constant with respect to z, but depends on the geometry (T). 
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In principle constants should not contribute, because of conservation of momen- 
tum. But we sum only over i # Jj, dropping i = 7 terms by normal ordering. 
Normal ordering is not conformally invariant, so we add back the constant so the 
short-distance behavior, and thus normal-ordering, is the same. In the tree case 
the geometry was trivial, so the true constant was fixed as a wave-function/coupling 
normalization. But now the function is nontrivial, and our normalization must also 
be consistent with the tree case. This “boundary condition” (actually, there are no 
boundaries for the closed string) can be imposed by requiring that “constant” be 
fixed in the short-distance limit. In principle, the Green function should always look 
the same at short distances, and not be affected by boundaries or topology. For 2 


vertices on the same boundary, 
lim G(p,T) = —In(|p|”) + O(p) 
where the term constant in p must be canceled by H, or 


lim eC? = |p| + O(") 


where the coefficient of |p| must be canceled. Then we find 


Pp =f 1 / 
0,0) (4 ss) x, (0 


277% 
The final result for the closed Green function is thus 


2 2 


AT) =n =In 


p=0 


—T 
277% 


2 
IG, ( == 2 
G(p,T) = —In 2001 ( seal Sei) (Re p) 


0%,(0| 55) Re T 


This result can also be obtained from the tree result (Green function for the cylinder), 
which is already periodic in p — p+ 277, by using an infinite sum to make it so under 


p—p+T: This results in the infinite product form of 6; above. 
Exercise XIC3.1 


Do this sum: 


a First find the Green function for the cylinder from that for the plane by 
z =e’. Then add a homogeneous zero-mode solution to get an expression in 
terms of just p — p’. (The zero-mode contribution needs separate evaluation 


for the reasons given above.) 


b Now make it periodic by summing over p — p+ nT for n = —oo,..., 00. For 
each n # 0 you need to fix the short-distance behavior as above. Finally, add 
(Re p)?/(Re T) to fix the zero-mode as above. 
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Using an identity for Jacobi 6 functions from subsection XIC2 we find, writing 


G(p, T) = G(v|r) in terms of the arguments of 6), 


Vv 
o(-y 


which states that e~°/? transforms with weight 1 under SL(2,Z) transformations of rT. 


atT+b 
cT+d 


) = G(v|r) + Injer +d)? 


In proving this identity one needs to cancel the phase factor from the 6; transformation 
with the contribution from the non-@ part of G; the following identities are then useful: 
First, from the result for SL(2,C), 


azt+b azyat+b © Zy— 22 = OTD Im 
ez; +d cxatd (cz+d)(cz+d) er+d_ |er+di? 


Then, from the imaginary part of the vector identity 


0= iViVi Vey =2 Im(ViV;) Vi + cyc. 


choosing 
it _ V 
Aare Oh Bara 
multiplying by |er + d|?/Im 7, and using the previous result, we find 
(Im)?  (Imv/? i 
Im ~  Imt era 


When performing this transformation we will also need for the volume element, 


using the same identity, in terms of w(7) and w’(7’), 7 = (at’ + b)/(cr' +d), 
w(lf(w)]}" = (er' + d)?u'[f(w')" 
(The relation for f itself, without the 24th power, has a 24th root of unity, which is 


conveniently eliminated in this form that appears in the volume element.) 


To analyze singularities in the open string, we’ll need to transform variables in 
the expression given in subsection XIC1 for the amplitude: 


N-1 
Ay = fF StreoP (in wy Pe | Q) emp |— 4D bi: kyGol2)/25 %) 


i<j 


we 


where effectively the open-string Green function Go = 2G in terms of the closed-string 


one G. We start with the change of variables 


y! hy OF SED 
WT) ord 
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In the general case, the Green function we start with will not necessarily be G(v|r), 
so the SL(2) transformation of G given above will not always be the same as the one 


just given to change variables: 


o( 
cT+d 


The second argument of G will also be that of f (as w = e?""7), so this same transfor- 


a7+b 
+d 


) = cto) + inter +a? 


mation will be used on f. Performing these procedures on the amplitude, using the 
above identities for G and f, and also the transformations for the measures 


dz dz’ 


dw du’ 
wo ra 


(cr + d)~?, (cr +d)! 


Ww w' 


we get for D = 26 an expression for the amplitude of the same form as the above, 
but with 


z,wz',w; GoGo); (-In w)-8 — (—-2mi7)-Bler! + dN a + diX-? 


4. Open 


There are various types of singularities that occur in open string diagrams, all of 
which are expected from Feynman diagrams, as long as we take topology into account 
(and satisfy the usual conditions for dimension and string intercept): 


1) poles, from external trees 

2) cuts, from internal 2-particle states 

3) external line divergences, from the loop sitting there 

4) closed-string poles, from recognizing the open-string loop as a cylinder 


(The closed-string pole goes into the vacuum if the diagram is planar, i-e., if all 
external states sit on the same boundary.) 


The easiest poles to find are the open-string ones, since they are in the same 


place as in the string trees (see subsection XIB6): When some number n + 1 of 
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consecutive insertion points approach each other, their Green function looks like the 
one for trees. The worldsheet picture is that these external lines are stretched away 
into a tree, emphasizing a propagator connecting that tree to a 1-loop graph. The p 
integration diverges in that region in the same way as for trees, and the calculation 


is the same, giving the same pole structure. 


The most interesting poles are the closed-string ones, since they appear neither 
in the trees nor in ordinary Feynman diagrams for particles (though higher-derivative 
modifications of ordinary field theories can produce them.) We saw in the particle 
case the usual UV divergence coming from the 7 integration near T' = 0, so we now 
examine w = e~’ & 1 for the string. The worldsheet picture is that the annulus 
is very short in the 7 (periodic) direction, but still 7 in the o direction, so it looks 
like a narrow cylinder, emphasizing the poles propagating along the cylinder: closed- 
string poles. For the planar graph, this closed string goes into the vacuum (zero 
momentum), but for the nonplanar ones there are states connected at each end, so 


we can see the momentum dependence. 


We begin by making some changes of variables. The first is the same as for the 
particle case, which we already evaluated: Separate the Schwinger parameters into a 


scaling parameter and Feynman parameters 
pi = —Ta; 


Next, in the language of the closed-string surface (found from doubling the open- 
string one), we want to switch the 2 directions of periodicity. We want to use the 
same (closed-string) Green function (periods 27 and 7), but noting that it is only 
the ratio of the 2 periods that is invariant under a scale transformation (a conformal 
transformation that doesn’t change its shape), the effect of this switch is 
i id 
— > —: — 
2 ii af 


Since now we replace the periodicities 
poprean pop 


with 
p — pl + 2ni, popt+T’ 


we also define 
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These imply 


277 1 p! Qyp 2 ! yal 
r= = - ——; use (2n)*/T = e2") /In a gies Qriv’ [7 


=a ge 
In the closed string case this transformation is a symmetry, but in the open-string 


case it replaces our point of view (o « 7) from an open-string loop to a closed string 
propagator. Thus the divergence at w = 1 (T = 0) is at w’ = 0. 


In terms of the discussion of the subsection XIC3, this means 


oe (° ’) (: ’) (° ') 
V,T=V,T, =|. ~-J= 
c d cc a —1 0 


/ 


edgy 
=> (—In w)~? — [-2mir(er’ +d) 8 (=< = 7) Sa a 


The 27’s can be attributed to our normalization of momentum integration, while the 


i’s are because, although w’ is still real, z’ is now a phase (p’ is imaginary), and the 
—1 is because the limits of integration for w’ have switched from w (0 < 1). The 
elimination of the /n w’s is an indication of the replacement of the 2-open-string cuts 


with closed-string poles. Looking at just the w’ dependence, we find 


1 dw’ 
re / ” P(w) 


W 


for some P that can be Taylor expanded in w’. (The most convenient expansion for 
this result is the product form of 0,/0{.) There are thus 2 divergences at w’ = 0, 


coming from the first 2 terms in the expansion of P. 


The nonplanar case can be obtained by the same method: The only difference is 
that when V; and V; are on opposite boundaries, Im(p;—p;) = tim. (Also, the p’s on 


each boundary are ordered separately.) We thus only need to replace for those G’s: 


poptin: z3-z, vovth, vari, pao’thl’, 2a zu 
(p, which was real, gets an imaginary part while p’, which was imaginary, gets a real 
part.) The only effects on e~°/? for w’ near 0 (besides the form of P(w’)) are from 


the factors 
— a = 1) 2 U 
Zf/2 7! 1/2 w' ae e (Re p’)?/2T ws 


The resulting extra contribution, again writing i = (J, J’) for the 2 boundaries, comes 


from the exponent 


So kr kp = (Sh) =» 


LI 
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for w’—'/8. The singular integral in w’ is thus now 
1 ~ 
cae | dw’ w' 8/82 P(w') 
0 


This generates the usual closed-string poles at sa's = 2(n — 1) = —2,0,2,.... For 
D # 26, there would be extra In w’ factors generating cuts as for w — 0, but now 
the cuts would be associated with closed strings (color singlets) instead of open, and 


thus be inconsistent with duality (which implies single-closed-string states). 


Exercise XIC4.1 
Compare this analysis of generation of poles to the corresponding “stringy” 


higher-derivative particle loop of subsection VIIIC5. 


The interpretation of the singularities of the planar graph is now clear: They 
represent a special case of the nonplanar one, where s = 0 because there is only the 
vacuum at the end of the closed-string propagator, representing scalar fields getting 


vacuum values. If we transform back to the usual Schwinger parameters, 


1 oe) foe) ore) foe) 
A Re dw’ wp! 3/8-2 cw” _ cn ff dT'e—T (-1-8/8) _ 
| 2c = 22% j IR 


n=0 


We can always make the integral converge by analytic continuation from s < 8(n—1). 
This always works in Euclidean space, except for tachyons (n = 0). But for the planar 
case s = 0: We can ignore the leading divergence in A (f dw'/w”), evaluating it as 
above by comparison with the nonplanar s 4 0. But the next-to-leading divergence 
({ dw’/w") remains, coming from the dilaton pole. In both cases these divergences 


are recognized as due to perturbing about the wrong vacuum. 

The nonorientable (“occidental”?) loop (Mobius strip) is also easy to get from 
the planar one. Note the interpretation of the open string as the closed string with 
reflection about the real axis in the p plane. The Mobius strip is like the planar graph, 
but after a period of T in the real direction there is a half-twist (flip). But because 
of the reflection, a shift by im is the same as this twist. Thus unlike the nonplanar 
(oriented) case, where we replaced (sometimes) p — p+ im, we now replace (always) 


ToTt+in => tor-} 


+ vow 


in the Green function. (In other words, we use the same expression as before for Gopen 
in terms of Gaosea, but with Im T = 7, so what we write as T below is really Re T.) 
This makes it periodic instead for p — p+ Re T + iz, while for p — p+ Re T we 


instead get a flip. We do the same for the partition function f: In operator language, 
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the expression e~'+'7)" (in terms of the number operator N) performs this flip on 
the “initial” states used to define the trace. 


To look at the singularity near w = 1, we again use the transformation 


but now this has the effect on G 


We therefore use 


#=54h & Y=29, 7 =2(27-1) 


The amplitude is then modified by 

cr’ +d 
|e + d| 
and the fact that the arguments of G and f, compared to the planar case, are 


12-N 
(—In w)—8 = [-2nir(er' + d)]-* ( ) Sag (ig) 2 


1/2 n/4 
7 


Jr2 w > -—w 
We therefore make a second change of variables 


1/2 n" ! ” 
Zi? = 2! w'/4 aw 


This also generates a factor 2%~! from the dz/z measure and a 4 from dw/w, so now 
the nonorientable diagram looks the same as the planar one except for w © —w in 
G and f, and an extra factor of 2'3. However, the planar graph has an extra group- 
theory factor of N for N “quarks” from tracing over its second, vertex-free boundary, 


while the Mobius strip has an extra group-theory factor, coming from the twist, of 
1 for USp(2N) 


0 for U(N) 

—1 for SO(N) 
This factor is most easily seen from the massless vector (adjoint) propagator: To 
preserve the symmetry of the adjoint representation, the massless vector propagator 
must be symmetric in its 2 group (fundamental) indices for USp, antisymmetric for 
SO, and asymmetric (no twist, but orientable) for U. We thus find the (leading, 
at least,) divergences of these 2 graphs can cancel only for SO(2'%), at least for a 


regularization scheme that respects this symmetric choice of integration variables. 
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5. Closed 


For the closed string we’ll consider just the orientable loop (torus). The one-loop 
amplitudes for the closed string can be obtained by methods similar to those for the 
open string (except for one cheat, which we’ll discuss below). The main modification 
is that we have a constraint to impose, namely AN = Ty — Ty = 0. The easiest way 


to impose this is by including a projection operator in the propagator, 
EGO 4 
I =| eee e 7AN = JAN 
ug oT 


If we combine this with the Schwinger parametrization of the propagator, we have 


in _ [ dr [ do e (THot+io AN) 
Ao 0 —T 20 


We then rewrite the exponent as 


TH + io AN = a Oa + Tp) + io (To = Ty) = plo + plo 


Here Tp is the same as for the open string, but with the replacement p — 3p: 


i= al (Sp) a ena 


(where this N is just for left-handed modes). The net effect is thus to double modes, 
with real p; and T’ becoming complex: 
2 


d 2T = 
dV > S°V(p), dP TH 4 — eTR+TR) 


So the vertices are now anywhere on the strip instead of just the boundary, while the 
sum over states in the trace includes averaging over arbitrary twists. Thus for the 


partition function we have 


1 


wt fol, [Ff (w)P-? = [LF (w) PP 


while the contribution to the volume element from the momentum integral is modified 
by the replacement p — 4p (or equivalently, by the closed string having half the slope 
of the open string): 


TPP = (—In w)"P? — (4Re TYP? = (—Hnful)-?” 


The amplitude is then 


(1) dw 2(2—D);_1 D/2 Ge 1 
- i aad 7 aac y i+ ky Gay 
An | saalteo (—5ln|w)) / (<5) EXP |— |Z kj kyGiy 


t<Jj 
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Unfortunately this procedure has one flaw: The torus is invariant under the group 
SL(2,Z), which divides the naive integration region for JT into an infinite set of copies. 
If we blindly follow the procedure above, the result for the amplitude will be infinite, 
simply because of the over-counting. This error is easy to see from 2D geometry, 
and can be fixed by hand. What is not clear is the error in the derivation, i.e., the 
relationship of the loop to the trees, without which the calculation is meaningless. (If 
the loops don’t follow from the trees, they don’t belong to the same theory, regardless 
of any symmetry arguments. This is exactly the problem of anomalies.) The solution 
should be found from string field theory, and is probably due to our implementing 
AN = 0 in too Abelian a way. 


The “modulus” T is the residual part of the original 2D metric not gauged away by 
the original invariances. Similarly, this SL(2,Z) “modular” invariance is the residual 
discrete part of the original 2D coordinate plus Weyl scale invariance on the torus, 
represented as a field transformation on the residual part of the metric T. (There is 
also continuous translation invariance in the o and 7 directions.) We can describe the 
torus as a parallelogram on the (flat) complex plane, with corners 0, 21, 22, 21 + 22. 
Now consider an arbitrary point z somewhere on the torus: This point is identified 
with the points 


Z=> 2+ nz; 
for integers n’ (i = 1,2). If we now consider SL(2,C) transformations of z; (not the 
conformal SL(2,C), but just transforming z; linearly) 


ee 
R= Gi 2 


we see that to preserve the torus, as defined by its periodicity, 


i Fe r _ 
2321+ =240% =| n° =n g; 


that the matrix g must be integers, i.e., an element of SL(2,Z). We thus have 


a b 
je 
4: (° i. 


for integers a,b,c,d with ad — bc = 1. Actually, the determinant condition is au- 
tomatic: It must be an integer, but the inverse of the group element is also in the 
group, so its determinant is the inverse of an integer, but also an integer. Thus the 
determinant must be 1. (The other possibility of —1 is uninteresting: It is the result 


of combining the SL(2,Z) transformations with a switch of z, with 22.) 


808 XI. STRINGS 


The usual conformal transformations include complex scale transformations, un- 


der which only the ratio 
cal 
T= 
oy) 


which we identify with the modulus, is invariant. Under a modular transformation 


, aTt+b 
i 
ct +d 


Similarly, for any point z, the ratio 


is also conformally scale invariant. It transforms under a modular transformation 


(where z is invariant) as 


P V 


ct +d 


Effectively, we have “gauged” (by conformal transformation) z2 > 1, and 7 and v are 
z, and z in this gauge, so the cr +d denominators in the modular transformations 


are compensating conformal transformations to maintain this gauge. 


First we need to check that the closed-string amplitude is invariant under modular 
transformations. (We already saw that the open-string amplitudes transformed in a 
simple way.) Taking the transformations obtained in subsection XIC3, we need only 


?’s in a few places. We also need the result (the 


modify the results by taking some | 


previous identity for Im 7’) 
In|w| = In|w'||er’ + d|~? 


for SL(2,Z). Remembering that for the closed string the tachyon has k? = 8, so 
a j kit kj = —4N, we find that each piece of the amplitude gives the same multiplied 


by the following exponent for |cr’ + d| after replacing v = v(u’,7’), 7 = T(v’, 7’): 


2 
— —A4, (3 
21 


(—Hnjw|)"8 26, e LFA _, oN 


1 
20 


dw dz 


o\ N-1 
+ -2(N-1), — |wl7|f>“ > —24 
w 


which cancel, proving invariance. 


Next we need to divide up the region of integration for 7 into “fundamental 
regions”: For any such region, any point in the upper-half complex plane can be 


mapped into it in a unique way by a “modular transformation” (which is unfortunately 
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also a “unimodular transformation” due to poor semantics). We started with the 


conditions 
—~71<ImT<7, ReT>0 > —4<Rer <i, Imt>0 


The former condition already takes care of the transformation 7 — 7+1. All modular 
transformations can be obtained from that and tT — —1/7, which takes the inside of 


the unit circle to the outside: Choosing the outside, we get the final conditions 
—3<Rer<5, %Imr>0, |r| >1 


It can then be shown that these choose one fundamental domain. (For example, by 
showing that an arbitrary transformation on the unit circle can produce only vertical 
lines, and circles centered on the real axis whose inverse radius is an integer, where 
all the lines, or circles of the same radius, are related by 7 + 7+.) The extra 


restriction also eliminates the usual UV singularity near T = 0. 


Exercise XIC5.1 


Find several of the other fundamental regions. 


Divergences in the closed-string loop are similar to those in the open string, except 
that now the divergence from the loop integral associated with a closed string going 
into the vacuum has the torus on the end of that tadpole. Since restriction to the 
fundamental region has already eliminated the UV divergence, this divergence now 
shows up only as an IR divergence from all the v’s near 0, i.e., factoring the graph 
as a closed-string tree times the tadpole. (For the open string we had to perform 
a modular transformation to change the UV divergence into closed-string IR, since 
taking v’s to 0 shows only the usual tree divergences.) Unlike the closed tree graph, 
where 3 of the vertices had fixed positions, so no more than N — 2 vertices could 
converge (which is symmetric with the fact that no less than 2 can), for the loop only 
1 vertex is fixed, so all N can converge (and there is no symmetry n < N —n because 


the loop itself is associated with vertices that don’t converge). The calculation is 
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the same as for the open string, except now we scale |v|? instead of just v, and the 
poles are closed-string instead of open. The divergence is again from the tadpole 
propagator, and its interpretation is as for the nonplanar graph, coming from the 


tachyon and dilaton, but with different coefficients. 


Note that the dilaton that appears here as a 1-loop correction (at least for the 
closed-string case) is really the determinant of the metric, in the usual string gauge 
for (26D) local Weyl symmetry: Since the vacuum value of the true dilaton ¢, defined 
as the field that couples to the ghosts (or, at order a’, to the worldsheet curvature, 
and thus the Euler number, which counts loops) and not to X, generates the string 
coupling through its classical vacuum value, it appears in the effective action along 
with h, homogeneously as (¢7/h)!~“, and thus not at all at 1 loop (except through 
derivatives, as 0¢/¢@, etc.). Thus, the 1-loop term that couples to the determinant (or 
trace, at linearized order) of the metric is actually the cosmological term. (Of course, 
if field redefinitions of the metric are made to get the usual classical R term, this will 


generate @ dependence, and ¢ will no longer count loops.) 


6. Super 


The calculation for 1-loop open superstring amplitudes with 4 or less external 
vectors is very similar to the superparticle case (see subsection VIIIC4), combined 
with the results for the open bosonic string. To begin, the vertex operator for the 


vector takes the same form as for the particle (see subsection VIIIC5): 
V= Kiel XX, =k Sop) 


where the spin operator S,») is now represented by an appropriate current (evaluated 
at the same z as X ) expressed in terms of worldsheet fermions (see subsection XIB5). 
The simplest way to do these calculations (so far) is in the lightcone. For 4 or 
fewer external lines we can choose the < 4 external momenta (< 3 independent) and 
< 4 external polarizations to all point in just the transverse directions, avoiding the 
complications of nonlinearities in the longitudinal components. Then the tranverse 


part SO(8) of the spin current is simply (in somebody’s normalization) 
Sig = Beda = gS IS 
in terms of the vector fermions w of Ramond-Neveu-Schwarz and the spinor ones S 


of Green-Schwarz, related by triality and bosonization. 


In the RNS case we still have to consider summing over R and NS strings, so GS 


is simpler. Among the various parts of the trace in evaluating the loop, we have in 
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particular the (super)trace over the zero-modes of S. In fact, this is the same trace 
considered for the particle in subsection VIIIC4: It is the trace over all the massless 
states created by those zero-modes. (As usual, the trace factorizes into that trace 
times those for the oscillators that create massive states.) If we sum over those states 
individually, we are performing (the lightcone version of) the same analysis we made 
for the particle. But we can also make the analysis for all states together, by making 
the same analysis as for a superparticle in the lightcone: For example, note that 
the S’s form an SO(8) Clifford algebra — they are the usual 7 matrices with indices 
switched by triality. Then we see that the supertrace is just the usual trace with an 
extra factor of “y_,” (since S takes the bosonic vector to the fermionic spinor and 
vice versa). So the first nonvanishing supertrace is that of 8 S’s (giving an € tensor 
in its spinor indices), ie., 4 S;;’s. The result will then have the same structure as 
the particle case: 4 F’s times the loop integral (less the fermionic zero-modes) for 4 
scalar vertices (tachyon-like, but massless). The explicit form of the F“ factor is the 
same as for the particle, and also the same as for the superstring tree. (Again the 
tree calculation of this factor is more complicated. For the bosonic string even the 


tree factor itself is more complicated, containing a’ corrections.) 


Exercise XIC6.1 

Repeat the calculation of exercise VIIIC4.1 of the zero-mode supertraces, 
again using components (i.e., evaluating the spinor and vector contributions 
separately), but now using just physical components, corresponding to the 
SO(8) transverse spin operators. Compare to the SO(10) calculation reduced 
to a lightcone gauge. For SO(8) there is also an € tensor contribution (re- 
member the spinor is Weyl), not appearing in SO(10): What happens to 
it? 

Now that we have reduced the loop to a “kinematic” factor for the polarizations, 
identical to that in the particle case, times a bosonic-string-like expression, we can 
do the rest of the calculation in analogy to the bosonic case. The Green function 
is the same as in the bosonic case, since we now have to deal with just X in the 
vertex. The partition function is actually simpler: For the oscillators, we now have 8 
bosons and 8 fermions with the same boundary conditions, so their supertrace is just 
1. (Remember that ghost fermions gave the inverse of X bosons.) For the remaining, 
bosonic zero-modes, the only difference is that the ground state is massless, so we 


don’t get the extra w~t. Thus the volume element is simply 


V=(-Inw)”? 
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coming from the momentum integration, now in D = 10. In summary, the result for 
the 4-point amplitude is the same as the bosonic tachyon amplitude (see subsection 
XIC1), except for: 


) the F* factor, 


1 
2) no power of f(w), 


( 

( 

(3) w? > we, 
(4) (—In w) appears to the power —5, and 
(5 


) external lines are massless. 
This analysis applies to planar, nonplanar, and nonorientable loops. 


In the net transformation given at the end of subsection XIC3, used for changes 
of variables in analyzing divergences, for the 4-point superstring amplitude we now 
have instead 

(—In w)~° = [-2mit(er’ + d)]-° = i(2n)* 


The main difference in the analysis of divergences is that the lack of the w~!f~*4 
factor (before and after changes of variables) removes the divergences associated with 
the now-absent tachyon (see the argument at the end of subsection XIC4), so all 
open-string divergences cancel for gauge group SO(2?/*) = SO(32) now. (There is a 
2N+1 — 2° in the final change of variables for the nonorientable loop.) 

The missing factor also means that now closed-string poles start with massless 
ones. The F factors (of the form |tr(F?)]? for the nonplanar loop) now give spin to 
these poles, and for the massless poles the singlet currents tr(F’F’) can be associated 
with the graviton, dilaton, and axion (4-form) coupling, as for the particle. The 
latter couples to the Hodge dual of the dual of the 2-form, i.e., an € tensor times a 


(D — 4)-form B: in form notation, 
L,~ BAtr(F AF) 


(In D = 4, B would be the usual pseudoscalar axion.) This preserves the Abelian 
gauge invariance of the (D —4)-form, since tr(F’ A F) is the curl of the Chern-Simons 


form (see subsection IIIC6): 
ir(FAF)=dC, C=tr(gAAdA+ fAAAAA) 


We can see how this relates to the usual 2-form B by a duality transformation: 
Starting with a first-order form of the (D — 4)-form action (ignoring the dilaton or 
string coupling), 

Ip ~ GAdB — 1G 
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and varying with respect to B in Ly + Ly, we find 
Lo + Ly > $G”’, G=dB+C 


Gauge invariance of G under Yang-Mills transformation implies an unusual transfor- 
mation for B: 
dC ~tr(dA\dA) => d6Bxtr(AAdA) 


(This is the same coupling and duality considered in D = 4 in exercise XA3.1, and 
supersymmetrized in exercise XB5.1.) 


Remarks similar to those about the open string apply to the closed-string am- 
plitude (torus), where everything is replaced with | |?’s again, but now including 
left and right-handed copies of the kinematic factor, with products of left and right- 
handed vector polarizations giving those of the massless states of the closed string. 
The proof of modular invariance works as before, but with the changes listed above 
for the open superstring. (For Type II strings one has only the torus, but Type I 
is nonorientable, and so has also the other graphs mentioned earlier. Here we don’t 


consider the heterotic string.) 


There are several ways to see that this amplitude has no divergences: (1) We 
already saw previously that such a divergence would be associated with a cosmological 
term. This term, if expanded about flat space, would give a 4-point interaction 
with no derivatives. But we have already extracted a kinematic factor that contains 
derivatives. Furthermore, it would contribute to lower-point functions, which we 
saw vanish. (2) Non-renormalization theorems in supersymmetric theories prevent 
generation of a cosmological term in loops. (3) By duality, the limit where some 
of the p’s get close would reveal this divergence as an intermediate dilaton state 
connecting a tree with a lower-point loop. But we have already seen the lower-point 


loop graphs vanish. 


Let’s look at the last argument in more detail: Applying this method in the same 


way as for open strings in section XIC4 and trees in subsection XIB6, we find 
dvs [ ae pr-it3 > kjk; 
0 


(1<n< N—1) where again n+ 1 p’s are taken close together, and the sum is over 
i < Jj for those n+ 1 k’s, but now we get an exponent 2n because open-string p’s are 


replaced with |p|?’s, and now 
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where M? = —8 for the closed bosonic-string tachyons, but 0 for closed superstring 
ground states. (As before, the T integration converges because the fundamental 


region restricts Re T > V3m.) So for the bosonic case we get 


1 


A osonic ™ d i ae Oe aes 
: i aa s+8 


giving the usual closed-string tachyon pole (or divergence for no s). However, the 


4-point superstring amplitude is convergent (at s = 0) for all n: 


A sper - | dp le fa for n=1 
0 


S — 

(i.e., 2 k’s). As expected, there are no tachyons, and there is no contribution from 
massless poles because the 3-point loop that has been factored out vanishes for ex- 
ternal massless states. (Similar remarks apply for such singularities in the open- 


superstring loop.) 


7. Anomalies 


There are no anomalies in odd dimensions, especially D = 11. Since string 
theories are equivalent to D = 11 M-theory, they have no anomalies. (This is similar 
to understanding cancellation of anomalies in the Standard Model by embedding it 
in a manifestly anomaly-free GUT.) However, the manner of anomaly cancellation is 


unusual, and may have application beyond the present string theories. 


Axial anomalies arise from massless (chiral) particles inside the loop. Thus we will 
first analyze anomalies in such field theories, then consider how these fields appear in 
string theory. The relevant field theories are 10D supersymmetric Yang-Mills coupled 
to N=1 supergravity, or N=(2,0) supergravity. 


In twice-odd dimensions (in particular D=10), unlike twice-even (like D=4), ir- 
reducible spinors are truly chiral (Weyl is not the same as Majorana): In D=4, 
Yang-Mills can couple chirally because a Weyl spinor and its complex conjugate may 
be in different representations (i.e., complex), but in D=10 even a real group rep- 
resentation can give an anomaly. Similar remarks apply to gravity: In D=10 even 
gravity can couple chirally, because a chiral spinor need not be accompanied by its 
opposite chirality to construct an action. Similar remarks apply to selfdual tensors 
(which also exist only in twice-odd dimensions), which can be generated from prod- 
ucts of selfdual spinors. Mass terms break chiral invariance, since they couple fields 
of opposite chirality, so only massless fields running around the loop contribute to 


the anomaly. 
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Exercise XIC7.1 
Using the general results for irreducible spinors of subsection XC2, what are 
the simplest spinor actions for all dimensions and all signatures? (Hint: There 
is a simple correspondence with something in the big table.) How many 
irreducible spinors are required to write an action in each case? What happens 


when masses are introduced? 


For the spinor in external Yang-Mills this calculation is standard (see subsections 
VIIIB2-3). We are left to consider the group theory. To mimic string theory, we use 
the ’t Hooft double-line notation for the adjoint representation in terms of indices of 
the defining representation, as implied by the quark interpretation for Chan-Paton 
factors. Thus each “planar” propagator has added to it a twisted propagator, with 
the factor 0 or +1 as described in subsection XIC4. (Equivalently, we could twist 
the vertices instead.) In the planar and nonorientable (with respect to this notation) 
graphs, we get for the anomaly F?/? without traces: For the planar case all vertices 
are on one side, and there is a factor of N from tracing the other side, while for the 
nonorientable case there is only one side. However, there is only 1 planar graph, 
while for the nonorientable case there are 2?/? (i.e., exactly half of all the graphs 
from twisting any of the D/2 + 1 propagators). Thus this particular contribution to 


the anomaly is canceled for SO(2?/*). 


For the nonplanar graphs some F’ vertices are on a different side than the y_1 
one, so there is a trace of those F’s. For the case D = 10, we already saw in 
subsection VIHIC4 that (using Bose symmetry to get an anticommutator) tr(F°) 
vanishes for SO(32). Of course tr(F) also vanishes, and for the same reason tr(F’) 
can’t contribute because the anomaly itself is adjoint. Finally, by adding a local 
counterterm to the action of the form tr(AF)tr(AF?) we can always convert the 
anomaly between tr(F?)F? and tr(F*)F. (At this lowest order in the fields we can 
use the Abelian part of F' and of the gauge transformation.) So for SO(32) in D = 10 


we can cancel all of the anomaly except a term tr(F“)F. 


At this point we remember that the nonplanar open superstring loop generates 
an unusual coupling in the 4-point amplitude between Yang-Mills and the axion (see 
subsection XIC6). Using this fact, we can write a local counterterm that cancels the 


anomaly (at least at this order), namely 


BAtr(FAFAFAF) 


Similar remarks apply to gravity anomalies, and mixed Yang-Mills/gravity anom- 


alies. Again the Yang-Mills generators are replaced with Lorentz generators. But for 
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pure gravity there are no Chan-Paton factors, so cancellations must be between dif- 
ferent spins: spin 1/2, gravitino, and self-dual tensor. It turns out that cancellations 
can be obtained in D=10 only for N=(2,0) supergravity or N=1 supergravity coupled 


to SO(32) or Eg@Eg Yang-Mills (or some uninteresting non-semisimple groups). 


Superstring theories generally violate (spacetime) parity because they use chiral 
spinors; the only exception is Type IIA, because the left-handed and right-handed 
modes have spinors of opposite parity, so spacetime parity includes worldsheet parity 


(switching left and right). 


The calculation of the open-superstring anomaly is simpler in the covariant RNS 
formalism than in the lightcone. It is very similar to the particle calculation of 
subsection VIIIB2, replacing Y with [do w-0X, Sa, with S Vay), y_1 with its GSO 
analog, etc., since the background appears via OX — OX(c) + A(x)d(o), the d(c) 
putting A on a boundary. Integrating out the fermionic zero-modes gives the same 
kinetic factor, again leaving m? times the graph with external massless scalars, the 


internal mass? shifted by m?, and no fermionic zero-modes. 


Unfortunately, now that scalar graph is nontrivial, since there are always states 
with mass much greater than m, no matter how big m gets. This is related to the 
fact that this 6-point graph can be factored into a 4-point loop times a 4-point tree 
(or 5-point loop times 3-point tree) in an appropriate limit, while in the particle case 
we needed to consider only a 1PI graph. The evaluation of this graph is similar to 
the one-loop superstring in the Green-Schwarz lightcone: The factors of f(w) again 
cancel, 10 being reduced to 8 for both the X and w oscillators by the ghosts. 


However, because of the shift in mass’, there is now an extra factor of wm /2 (ie., 
the ground-state mass is now m). As a result, the limit m? — oo is dominated by the 
region w © 1 (T_ = 0). Therefore, we again transform coordinates, and again find the 
largest contribution comes from the (shifted) dilaton, whose propagator now gives a 
factor 1/m? going into the vacuum. (All other contributions die more rapidly, and 
vanish as m — oo even after multiplying by the overall m?.) Again as for the 4-point 
amplitude, this dilaton contribution cancels between the planar and nonorientable 
graphs for SO(32). However, for the nonplanar graph, this dilaton contribution van- 


ishes, again as for the 4-point; we have seen the explanation in subsection XIC4. 
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XII. MECHANICS 


String theories describe particles of arbitrarily large spins: So far in this text we 
have concentrated on lower spins, but we can describe (at least) free gauge-invariant 


actions for arbitrary spins based on quantum mechanical BRST. 


Gauge invariance is required in field theory to manifest Lorentz invariance. The 
basic problem is that a four-vector wave function cannot have the obvious Minkowski 
inner product, since the time component would have a minus sign in its normalization, 
resulting in negative probability. In the classical action there is a gauge invariance 
that allows the time component to be dropped from the action. However, such gauges 
destroy manifest Lorentz invariance, since a three-vector cannot represent Lorentz 
transformations in a local way. More useful gauges keep all components of the four- 
vector, while also introducing scalar fermionic “ghosts” to cancel the effects of the 
bad part of the four-vector. A certain symmetry between the bosonic and fermionic 
unphysical degrees of freedom is needed to enforce this cancellation: It is the field 


theoretic version of the BRST symmetry discussed in section VIA. 


Another complication is that gauge transformations do not allow the elimination 
of traces in a simple way: Although it is Lorentz covariant to constrain a tensor 
to vanish when a pair of its vector indices is contracted, this interferes with gauge 
invariance in interacting theories, such as gravity. A related complication is massive 
theories, which can’t always be described simply by adding mass terms to massless 


theories. 


There is a simple solution to all these problems, which determines the free part 
of the action for any theory. (Interactions are a separate problem.) This method 
automatically introduces all the correct fields, including ghosts, for any massless or 
massive theory. It also gives a simple universal expression for the BRST symmetry 
that cancels unphysical modes, as well as providing a simple proof that these modes 
disappear in the lightcone gauge. The method is based on the idea of introducing 
extra fermionic dimensions to spacetime that are unphysical (unlike superspace for 
supersymmetry), which cancel unphysical degrees of freedom associated with the time 


dimension. 


Although for most purposes the only spins of fundamental particles relevant in 
field theory for are 0, 1/2, 1, 3/2 (maybe), and 2, and these few cases can be studied 


separately, in this chapter we’ll analyze all free theories because: 


(1) The ultimate theory of particles may require them; 
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(2) some of the theories presently under most active investigation (such as strings 


and membranes) require them; 
(3) many observed, though perhaps not fundamental, particles have higher spin; and 


(4) a better understanding of field theory can be obtained by determining exactly 


which properties all fields have in common as well as how they differ. 
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This construction involves the introduction of spacetime symmetries that are not 
manifest on the physical coordinates. An important analog is the conformal group in 
D dimensions, which acts nonlinearly on the usual D spacetime coordinates, but can 
be represented linearly on D+2 coordinates, since the group is SO(D,2). As described 
in subsection IA6 for spin 0, 1 space and 1 time coordinate can be eliminated, so that 
SO(D,2) is still represented, but SO(D—1,1) is the largest orthogonal group that is 
still manifest. We have also seen that in the lightcone gauge this manifest symmetry 
is reduced again in the same way, leaving SO(D—2). In our case the relevant group is 
OSp(D,2|2), the natural generalization of the orthogonal group to D space, 2 time, and 
2 anticommuting dimensions. This allows rotations between timelike and fermionic 


directions, eventually resulting in their cancellation. 


1. Lightcone 


We saw in subsection IIB3 how the single equation of motion S,°0, + wd, = 0, 
applied to field strengths, universally described all spins in all dimensions, for free, 
massless particles. (A possible exception is the spinless case, where we need HI = 0, 
which is redundant otherwise. However, we can use the universal field equation even 
in that case if we use the vector field strength formulation of spin 0.) One way we 
solved this equation was to perform a unitary transformation. We can use the same 
unitary transformation, plus the constraints, to simplify the Lorentz generators. To 
further simplify matters, we can use the constraint LJ = 0, solved for O~, to choose 
the gauge at = 0, which is equivalent to working in the Schrédinger picture (no 
time dependence for operators). The procedure is thus: (1) Start with the manifest 


(antihermitian) representation of the Lorentz generators, 
je = Faurala 4 gb 


(2) Apply the transformation 


fav sy InU= a 
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which eliminates the only S** term, in J** (while complicating J~’). (3) Finally, apply 


the constraints, which have already been transformed by this same transformation, 
O=0 (> gauge x* = 0), s”a, + wd* =0—- S*- -w=S* =0 
Our Lorentz generators are then 
[Sag 0": Jt =-2°0t+u, J = gal + gu 
J = -2- G+ = [12i(#)? + SY0; + wd" 


Oot 


These generators satisfy the pseudo(anti)hermiticity condition 
Jot (oy = —(0*) 1—2w gab 


This means that the Hilbert-space metric needs a factor of (0+)!~2”. This is related 
to the fact that the w terms can be eliminated by a nonunitary transformation with 
the appropriate power of OT: As part of step 2, we could have applied a second 


transformation 


Us = (ate 


with the result of eliminating all St~ terms, so the redundant constraint St~ = w 
would not have been needed, so w would not appear. Uy is in fact just the trans- 
formation that takes the surviving independent part of the field strength F***- to 
the lightcone gauge field A’-’, taking us from the original constrained field strengths 


to unconstrained gauge fields. In any case, we generally choose w = 0 for bosons. 


We previously applied dimensional reduction to the field equations for the field 
strengths, to obtain the equations for the massive, free theories from the massless 
ones. The same methods can be applied to the Lorentz (or Poincaré) generators. 
(The Lorentz generators will be used later to find the BRST operator, to obtain the 
field equations in terms of the gauge fields, and the action. For that purpose the 
dimensional reduction can be performed at any stage in the derivation.) We thus find 


the general result 
pte _zift , JS 0's w, Ji = lei ch. ia 


J- =-2°0' + AE 5x"[(07)? — m7] + 840; + S* tim + wd'} 


(The —1 is still an index, and should not be confused with an inverse.) 


Note that S~* and S~* were eliminated (after the unitary transformation) by the 


constraints, and that St* just dropped out. (In other words, S** = 0 was the gauge 
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choice for the constraint S~' = 0.) This leaves only S” (and S*~! in the massive case), 
whose representation is that of the highest-weight part of the original field strength. 
However, we can more simply choose the representation of S” as our starting point, 
since it is just the transverse part of the gauge field; it defines the representation of the 
Poincaré group. We therefore have an explicit construction of the generators of the 
Poincaré group, for arbitrary representations, defined on just the physical degrees of 
freedom, given directly by the little group SO(D—2) spin generators S$”? (or SO(D-—1) 
generators S”7 and S‘~! in the massive case) that identify the representation. For 
example, in D=4, SO(2) has just one generator, the helicity, so for any state of a 


given helicity we know the action of the Poincaré generators. 


Exercise XITA1.1 
Check that this lightcone representation of the Lorentz generators satisfies 


the correct commutation relations. 


Classical free field theory is easy to define in the lightcone, since solving the 
constraints in the lightcone formalism has picked out just the physical components, 
so the only remaining constraint is the Klein-Gordon equation. Thus, the kinetic term 
for any massless bosonic field is simply —3¢($0)¢, where $0 = —0*0~ + $(0')?, 
and 0” is considered the time derivative. (In general, the kinetic operator for a 
massless boson is some second-order differential operator, which reduces to 0) on the 
physical components.) For fermions we have instead 11/07, since we must then have 
an odd number of derivatives to avoid getting a trivial result after integration by 
parts. (For a boson, ¢0¢ = 0(4¢7), for a fermion PIIW = O(WOW) — (OW)? = O(Wdw) 
by anticommutativity. In general, the kinetic operator for a massless fermion is some 
first-order differential operator, which reduces to 11/0* after eliminating auxiliary 
fields.) 


This quantum mechanical representation of the Lorentz generators has a simple 
translation into classical field theory, in terms of field theory Poisson brackets. The 
definition of Poisson brackets in lightcone quantum field theory follows directly from 
the action: Defining as usual the canonical momentum 7 as (minus, in our conven- 
tions) the variation of the Lagrangian with respect to the time derivative 0, (= —O7) 
of the variable ¢, we find the fundamental bracket for bosons 


cao) = Berea wie in (On) P60 — gi-)6P-2(g8 — a) 


(Note that the 0* was essential for the antisymmetry of the bracket. We can also 


evaluate its inverse as an integral, so 7-d(a~ — a’) = 5e(a~ — a2’). For fermions 


we have instead an anticommutator and no 1/0*.) We then find that any quantum 
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mechanical group generator J (including internal symmetries) can be represented in 


field theoretic form as 
dx~ d?-2z" 
G2 oo aye $0" 10 = iH(6lJ0) 


where we have used the relativistic inner product of subsection VB2, but for a lightlike 


hypersurface: For positive-energy solutions 
ame hada wo 
(l2) = f era DOs 


Note that the free Poincaré generators are local in this form, from cancellation of 
O*’s. In interacting theories, the generator J~', as well as the translation generator 
P-, which is also the Hamiltonian, gets additional terms higher-order in the fields. 
In this manner, relativistic quantum field theory can be quantized in a way that 
more resembles nonrelativistic field theory than in non-lightcone methods, since 0 
is quadratic in the usual time derivative 09. We won’t consider lightcone quantum 
field theory further; however, in the following sections we’ll use this construction to 
derive free gauge theory and its covariant quantization, in a way that we’ll generalize 
straightforwardly to interactions. Thus, the same construction directly gives the 
formulation of free representations of the Poincaré group, from field strengths to 


transverse fields to covariant gauge fields. 


2. Algebra 


From the definition of the graded determinant in terms of Gaussian integrals (see 
subsection IIC3), we see that anticommuting coordinates act like negative dimen- 
sions: For example, sdet(kI) = k*° for a commuting and b anticommuting dimen- 
sions. Thus, if we add equal numbers of commuting and anticommuting dimensions, 
they effectively cancel. Here we’ll do the same for theories with spin, which allows 
the restoration of manifest Lorentz covariance to lightcone theories: Adding 2 com- 
muting and 2 anticommuting dimensions to SO(D—2) gives OSp(D—1,1]2) (see also 
subsection IIC3), which has an SO(D—1,1) subgroup. 


We have seen that quantum field theory requires unphysical anticommuting fields 
to cancel the commuting unphysical fields introduced by using gauges that do not 
eliminate longitudinal polarizations. For example, the gauge field for electromag- 
netism has only D—2 components in the lightcone gauge, but needs to keep all D 
components to maintain manifest Lorentz covariance; this requires 2 “ghosts” to can- 


cel the 2 extra components of the gauge field. The general result, at least for bosonic 
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gauge fields, is to produce fields that form representations of OSp(D—1,1|2), includ- 
ing gauge fields and ghosts. Furthermore, by adding 2 anticommuting dimensions the 
BRST transformations that relate the ghosts to the longitudinal degrees of freedom 
can be introduced in a natural way, as translations in the new coordinates. The result 
is that OSp(D—1,1]2) multiplets are automatic, and gauge fixing receives a geometric 
interpretation. In this section we’ll see that an even more natural interpretation of 
these BRST transformations is as rotations of the anticommuting coordinates, and 
that they not only make gauge fixing to the simplest Lorentz covariant gauge triv- 
ial, but also give a simple derivation of the gauge invariant action itself. (The two 
points of view are related in that translations can be considered as part of “conformal 


rotations”, as we saw in subsection IA6.) 


The basic idea is very simple: Take the lightcone representation of the Poincaré 
generators, found in the previous subsection, and extend the SO(D—2) indices and 
representations to OSp(D—1,1|2) ones (including appropriate signs for the grading). 
Conversely, we can begin the construction with the “conformal” group OSp(D+1,3]2), 
find the equations of motion for the “Poincaré” group OSp(D,2|2), and solve them 
for the “lightcone group” OSp(D—1,1|2). So we can use the same expressions for the 


generators, but now the “transverse” OSp(D—1,1|2) index is 
1 = (Ga) 


where a is a D-component index of SO(D—1,1) and a is a 2-component index of Sp(2). 
The OSp(D—1,1|2) metric is 
ni = ig? Cm) 
Furthermore, we divide up the full OSp(D,2|2) index as 
(4,1) =(A,a), A=(4,a) 


We now interpret the SO(D—1,1) subgroup that acts on the a index as the usual 
physical one, since the generators take the usual covariant form (because all the 
transverse generators are linear). The orthogonal subgroup OSp(1,1|2) that acts 
on the A index, and leaves the a index alone, is then interpreted as a symmetry 
group of the unphysical degrees of freedom, an extension of BRST (and “antiBRST”). 
(Note that OSp(1,1]2) is the group of coordinate transformations in 2 anticommuting 
dimensions; for later reference, IGL(1) is the same for 1 such dimension). However, the 
generators with — indices are nonlinear, since the + indices are no longer independent 
from the rest. (The + were gauged away, the — were fixed by equations of motion.) 


As a result, they act on transverse indices in a nontrivial way. 
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= A: ghost OSp(1,1]2) 


longitudinal, nonlinear SO(1,1): ate \ 
transverse, manifest OSp(D—1,1|2): i = { 


: Lorentz SO(D—1,1) 


Since the OSp(1,1]2) generators act only in the unphysical directions, all physical 
states should be singlets (with respect to the cohomology) under this symmetry. 
This is clear from the original construction: We started with linear generators for 
OSp(D,2|2) (and translations and dilatations for (D,2|2) dimensions), applied the 
equations of motion in terms of them, and now we apply the OSp(1,1|2) singlet 
condition last. If we had instead applied the OSp(1,1|2) singlet condition first to the 
(D,2|2) dimensional space, we would have gotten the usual (D—1,1|0) dimensional 
space, and finally applying the equations of motion would have given us the lightcone 


results of subsection IIB3. 


OSp(D,2|2) 
manifest symmetry: ra Y 
SO(D-1,1) == OSp(D—1,1]2) 
\, field equations (fix +) field strengths gauge fields/BRST 
/ BRST singlets (fix A) Ay fe 
/ add 242 (extend i — (a, a)) SO(D—2) 
lightcone 


Explicitly, the OSp(1,1|2) generators are (choosing w = 0) 
jte = —2°9t, Jot = x Or, Jw = AP) 4 ge 
1 
JO =x 0% + prise" (O —m + 0°0,) +S”; + S* im + Sp] 


while the SO(D—1,1) generators take their usual manifest form 
je _ Pautala te gb 


Exercise XITA2.1 
Write the general commutation relations of OSp(D—1,1|2). Specialize to the 
case OSp(1,1|2), in lightcone notation. Show that this representation satisfies 
them, paying special attention to signs. (Use the OSp(D—1,1|2) commutators 
for the S’s.) 
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We can also add a “nonminimal” part to the general “minimal” part of the 
OSp(1,1|2) algebra we have already derived, in the sense that the two parts com- 


mute and separately satisfy the commutation relations: 
JAB _, JAB 4 GAB 


(This is similar to adding spin pieces to orbital in the absence of constraints relating 
them.) The simplest choices are to choose this new part to be just quadratic in new, 
“nonminimal” coordinates and momenta. It will prove convenient to perform some 
transformations J + UJU~' that make the OSp(1,1|2) generators more similar to 
what they were before adding the spin parts. We therefore make two consecutive 


transformations: 
J UU, JUL Us): Uy =e **0/8* UY = (8,9 


to return J~* and J*® to their previous forms. In fact, these are just the OSp(1,1|2) 
version of the same transformations we used in the previous subsection to remove S** 
and S*~. The result is 


Jesse, Jd ac 0, UP autora se 
i és 
J* =—x 0% + pe le(-K = 50°03) + Oh + 5° Ag] 


Here we have 


K=-3(O-m’), $%= 597+ 99°, Q%> = 9° 45%, + 8° tim — St°K 


but more generally we can satisfy the commutation relations by requiring only that 
K, Q,, and Sap are independent of the unphysical coordinates x~ and x® and their 


momenta, and satisfy that their only nontrivial commutators are 


{OO SIs’, sO ,=00O™, (54,5) a0 7s." 


U2 is nonunitary, which makes S*¢ hermitian (rather than antihermitian) after 
the transformation, requiring a modification of the usual representation for S. The 


usual representation can also be used by introducing an 7 into the transformation, 


which gives S*° a factor of +i in Q°. However, this i can be removed by the same 
method used in subsection IIB4 to remove 2’s associated with the index —1, only now 
it is applied to both the + and — indices. 
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3. Action 


We saw in the previous subsection that physical states are singlets under the 
OSp(1,1|2) BRST symmetry. It was introduced in a trivial way, but became nontrivial 
after solving the equations of motion; on the other hand, applying the singlet condition 
first reproduced the usual lightcone analysis. Reversing the order of applying the two 
conditions has the advantage of allowing the physical state condition to be expressed 


as a single equation, which can be derived from an action. 


There are two ways of doing this: One is to use this algebra to generalize to gauge 
fields the first-quantized BRST as applied to field theory in subsection VIA3. Because 
of their quantum mechanical origin, the gauge-invariant Q@ actions directly give a 
form suitable for choosing the Fermi-Feynman gauge, where the kinetic operator is 
simply O—m?. However, this is somewhat unusual for fermions, whose simplest: field 
equation is first-order. (But it is useful for supersymmetry, where bosons and fermions 
are treated symmetrically.) As a result, this approach gives actions for fermions with 
an infinite number of auxiliary and ghost fields. The most convenient way to discover 
the usual finite-component gauge-invariant first-order actions hidden there is by per- 
forming an appropriate unitary transformation, after which this action (for bosons or 
fermions) appears as the sum of three terms: the usual gauge-invariant action, a term 
giving the usual second-quantized BRST transformations, and a nonderivative term 
that would be considered nonminimal under second-quantized BRST. This approach 


will be described in detail in the following sections. 


The other way is to define a 6 function in the generators of the group OSp(1,1]2), 


and use it as the kinetic operator for the action: 
ss - f ae dx d’x* 468*8(Jap)® 


where the integration is over all the coordinates appearing in the OSp(1,1|2) gener- 
ators. (The dz part is the usual d?x/(27)?/?.) 0+ comes from the usual relativistic 
inner product; it is also a “measure” factor, which is a consequence of our using 


generators satisfying the pseudohermiticity condition 
J‘ 4p0* = —O* Jap 


Equivalently, we could redefine  — (0+)-/?6, J > (d*)-¥/?2 (A+)? (assuming 
o* # 0, as usual in lightcone formalisms) to eliminate it and restore hermiticity. 
(This would only affect J>* + J~+ — 3, J0- — J + 0°/20*, making hermitian 
the terms ${2~,0T} and +{a*, 0°03}/0*.) Because of the 6 function, this action has 
the gauge invariance 

6B = 1 JPA Age 
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Thus, the field equations and gauge invariance reduce & to states in the OSp(1,1|2) 


cohomology. 
More explicitly, the 6 function can be written as 
a é(Jap) = 0" (dae OT *) or ™) or) 
= §(2~)5?(x%)5(Sqg2)O? I~? 
where we have used 


Poets Ie Ss AO Sa 


(There is freedom in ordering of the original 6 functions: Reordering of any two 6’s 
produces terms that are killed by the other 6’s.) The 6(S.g”) can be interpreted as a 
Kronecker 6,9 in the Sp(2) “spin” s: 


=40' Sg = A4s(s + i) 


(remember S$“? is antihermitian, and iS®° is always integer while s can be half- 
integer). The rest of the explicit 6’s are Dirac 6’s in the unphysical coordinates, 


which can therefore be trivially integrated out, leaving: 


ss fu Lig, Lyi = 50K gid, 3(-O + m? + $9°Q,) 
where ¢ is @ evaluated at x° = x = s = 0. Furthermore, the remaining gauge 
invariance is 
do = 605 O° Aa 
from J°-, since J°%, J*+*, and J~+ have been used to gauge to s = a* = x = 0, 


respectively. 


Exercise XITA3.1 
Show explicitly that this action is invariant under the OSp(1,1|2) gauge trans- 


formations. (Hint: Use the same method as exercise XA2.1.) 


4. Spinors 


As we saw in subsection VIA3, the BRST algebra for the (Dirac) spinor requires 
nonminimal terms. For the general case of fermions we add these terms in the general 
way described in subsection XIIA2, choosing them in terms of a (second) set of 
OSp(1,1|2) y matrices: 


GAB 1A a BY {aA 5B) = 48 


828 XII. MECHANICS 


where 74 = (—k, —ps; E, ic) in the notation of subsection VIA3. In particular, we find 
Go8 — geek _ La(anB) 
O° = 8A, + Se tim + 7° [7 + 974(0 — m’?)| 


The next step for general massless fermions (and similarly for the massive case) 
is to apply 
Pe oe Ss OOH =P Ra Opa ek 
where we have used 
V¥a = CSM, Fal = 1 
At this point we note that the gauge invariance generated by Q°, for gauge parameter 


Aq = JaA, includes a term 7 A that allows us to choose the gauge 
Ve =0 


One way to think of this is to treat Yt as an anticommuting coordinate and 7~ as 
its derivative; another way is to treat them as 2x2 matrices. Alternatively, we can 
unitarily transform the action to contain just the y~ term of the operator: From the 
discussion of 2D y matrices of subsection VIIB5 we find, including that part of the 
spinor metric, 

TH = —(o0) 
then acts as a projection operator. Either way, the net result is to reduce the action 


to, now restoring the mass, 


Ss = pu Ligi F Lge — 1bK git, Kot = 5° (Sa°Oa st Sa—11M) 


where ¢ is with the 4+-dependence eliminated (the top component in the above 
matrix representation). Thus, db differs from the bosonic case in that it not only 
depends on x° and is a representation of S”, but is also a representation of 7%, which 


appears in 5° to define s = 0. 


The only type of representation we have missed in this analysis is self-dual anti- 


symmetric tensors. In terms of field strengths, these satisfy 


F pros--bo/2 


Q@1-.-AD/2 


_ 1 
= + HH Ear...apabr--bp 2 


which is consistent, with Lorentz metric, if D/2 is odd (as seen from applying the 
€ tensor twice). A similar condition holds for the gauge field in the lightcone gauge 


(with a (D—2)-dimensional € tensor). Because of the € tensor, this condition can’t be 
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described by adding extra dimensions to the lightcone. However, the direct product of 
two spinors contains all antisymmetric tensors, and the rank D/2 one can be picked 
out by an appropriate OSp invariant constraint. The self-dual part of this tensor 


comes from the direct product of chiral spinors. 


Exercise XITA4.1 


We now consider this construction in more detail: 


a Derive the generalization of y_; to OSp y matrices, anticommuting with both 
the fermionic and bosonic y’s, {y-1, ya} = 0. We can use the usual product 
for the fermionic y’s, but obviously the bosonic ones will need something 
different. (Hint: For each pair of fermionic or bosonic y’s there is a Klein 
factor, as in exercise [A2.3e; for the fermionic y’s the exponential is equal to 


the usual product.) 


b In twice-odd dimensions, consider the direct product of two spinors by rep- 
resenting the OSp spin operators as a sum in terms of the two different sets 
of OSp y matrices acting on the two different spinor indices. Define the U(1) 
(O(2)) symmetry that mixes the two y matrices by taking linear complex 
combinations of the y matrices to form fermionic creation and annihilation 
operators, so the OSp-invariant U(1) generator is a'4a4. Show that the eigen- 
values of this generator pick out the different Lorentz representations. These 
can be made irreducible by including y_; projections. Using explicit U(1) 
and (both) y_1 projectors in the action, show that self-dual tensors can be 
described. (Note: This description contains an infinite number of auxiliary 
fields.) 


5. Examples 


The OSp(1,1]2) method is thus an efficient method for finding gauge-invariant 
actions (though not so useful for gauge fixing). We begin with examples of massless 


bosons, for which the gauge-invariant kinetic operator is 
Kg = 3(-O+ 59°9,), Q* = "0, 


The scalar is a trivial example; the simplest nontrivial example is the massless vec- 
tor: In terms of the basis |') for an OSp(D—1,1|2) vector (D-vector plus 2 ghosts), 
normalized to 


(ant = Can (i) = 08 


830 XII. MECHANICS 


we can write the OSp(D—1,1|2) generators as 
Say = "| 
The Sp(2)-singlet field is then (dropping the |“) term) 
@ = |")Aa(*) 
We then have 
Q* = (|) 1— |) CG. = QD? = 5 O° Q,. = |*) ("OH — 31°) (ol 
= Legit) = §3(Fav)” 
and for the gauge invariance 


5b = 5940°Ag => Aa=la)A(z) => 6Ag=dQA 


A more complicated example is the graviton (massless spin 2): We write the field, 
a graded symmetric, traceless OSp(D—1,1|2) tensor, in terms of the direct product 
of two vectors, with basis |’)|’). The spin operators are thus 
S¥= St @ly + ly @Sh 
where the first factor in each term acts on the first factor in |')|’), etc.; Ij) is the 
spin-1 identity. The s = 0 part of the field is then 


P= |)P Rg, R= hatha =0 = b= (\%)) +31 )la)N™) hap 


where hy, includes its trace. The rest is straightforward algebra; we use identities 
such as: 
P= Oi) ® La) + Ia) ® Qi) as OQ") ® Qiayja 


Qi ® Daj = (Ila) 7101 + [YPC al) a 
(C1 (al) (I?) Lad) = — (1?) (ala) = -2 


where Qi) was evaluated above, and in the last identity we used the fact that |®) is 


anticommuting. The final result is then 
Lgi2) = — Fh Ohas + 2(0hap)? — h*,10h®, + 2h2,0°O*hpg| 


in agreement with subsection IXB1. The original OSp(1,1|2) gauge invariance reduces 
to 
Ma = (la) I") + |*)la))Aa = hab = Oars) 
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Exercise XITA5.1 
Consider a (D—2)-rank antisymmetric tensor (i.e., totally antisymmetric in 
D-—2 indices in D dimensions; see exercises ITB2.1 and VIIIA7.2, and subsec- 
tion XA3). 


a Show from a lightcone analysis that it is equivalent to a scalar. Derive the 
gauge-invariant action using OSp methods. Find the gauge transformations 
and field strength. 


b Find a first-order form for the action, (auxiliary field)? + (auxiliary field) x 
(field strength). Show that eliminating the gauge field as a Lagrange multipler 
results in the action for a scalar. Show that switching between scalar and 
antisymmetric tensor is equivalent to switching field equation and constraint 
for the field strength. 


c Find the description for the massive case by dimensional reduction. 


Exercise XITA5.2 
Consider a tensor totally symmetric in its vector indices. In the lightcone 
gauge, the irreducible tensor is traceless. Show that, upon covariantiza- 
tion, the field appearing in the gauge-invariant action satisfies a double- 
tracelessness condition (or equivalently the fields appearing there are the 
totally traceless tensor and another totally tracelsss tensor with two less in- 
dices). 
For massless fermions we saw 
Kgs = "S20. 
The next step is to use the fact that arbitrary fermionic representations are con- 
structed by taking the y-traceless piece of the direct product of a (Dirac) spinor with 
an irreducible bosonic representation. (Just as an irreducible bosonic representation 
of an orthogonal group is found by taking the direct product of vectors, choosing an 
appropriate symmetry, as described by the Young tableau, and requiring the trace 
in any two vector indices to vanish; here we also require that using a y matrix to 
contract the spinor index with any vector index also vanishes. Of course, simpler 
methods can be used for SO(3,1), but we need methods that apply to all dimensions, 


so they can be applied to orthosymplectic groups.) We then can write 

G8 = F9 Lyi yi} a G08 = $28 _ gilag®) 
where SJ is the part of the spin acting on just the vector indices, and we have 
combined 7° and ¥° into creation and annihilation operators, as in subsection VIA3: 


ae -_ aay" + 14%), qi@ — al" _ iy); [aq, at? _ —6? 
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For spin 1/2 S = 0, 6.0 projects to the ground-state of the oscillators, and we 
immediately find 
Oa = —Ya'y" > Lgi(t/2) = 5OY 1000 
where we have used 


Ay = i¢(alag — a%al,) = i(al?ag — 1) = -i(N + 1) 
and this “N” counts the a! excitation level. (Note that the Hilbert-space inner 
product between the spinors includes the usual factor of yo.) A less trivial case is 
spin 3/2: Now 

G=|bi, 9° = —y%y* + |!) (4 
where ¢; has an explicit vector index, and an implicit spinor index. Then from y- 


tracelessness (for irreducibility) we have for the Sp(2) singlets 
bi =0 = ¢ba=—Va¥"ba > Aaba=9, a= at a"ba 


After a little algebra, using identities such as 


Eyer] = yer + F (Pry) — ¢y) 


we find 
Le gi(3/2) = — dba yisbc 
From inspection, or from 6¢ = 6,95 Q° Aq, we find the gauge invariance 


A 


dda = Oar 


Exercise XITA5.3 


Let’s now examine some massive examples: 


a Find the gauge-invariant actions for massive spin 2 and spin 3/2 by dimen- 


sional reduction of the massless cases. 


b Note that for the spin-2 case the part of the mass term quadratic in h is 
proportional to —hjq“hy°. More generally, we might have expected (hay)? + 
k(hq*)? for arbitrary constant k, since the first part gives mass to the physical 
(transverse, traceless) part of h, while the second term affects only the unphys- 
ical pieces. Find the Stiickelberg terms generated from this generalized mass 
term by the linearized gauge invariance. Looking at just the terms quadratic 
in the Stiickelberg vector, what is special about k = —1, and why do other 


values of k give ghosts? (Hint: Compare gauge-fixed electromagnetism.) 
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S seuiarateeeaea eae B. IGL(1) 


Although the OSp(1,1|2) method is the simplest way to derive general free gauge- 
invariant actions, it does not yield a simple method for gauge fixing, even though 
the Hilbert space contains exactly the right set of ghosts. We now describe a related 
method that is slightly less useful for finding gauge-invariant actions (it includes 


redundant auxiliary fields), but allows gauges to be fixed easily. 


1. Algebra 


For this method we use a subset of the OSp(1,1|2) constraints, and show they are 
sufficient. A simple analog is SU(2): To find SU(2) singlets, it’s sufficient to look for 
states that are killed by both 73 and the raising operator T; + i775. This approach 
gives a formalism that turns out to be easier to generalize to interacting theories, as 
well as allowing a simple gauge-fixing procedure. We first divide up the Sp(2) indices 
as a = (@,) (not to be confused with +). We then make a similarity transformation 


that simplifies some of the generators (while making others more complicated): 
J>uUJU': w= (arty 
which changes the Hilbert-space metric (and corresponding hermiticity conditions) to 


Y =UtU =(-1)"°°, (WE) = pure 


This simplifies (looking at the massless case without SA4® for simplicity) 


+ _ 4 7a , Je? + Jee 


mee | 
Je — JPA + iJPPO® = (-2 0*)0® + $2°D + SO, + 8° Ag 


(We use the same conventions for raising and lowering Sp(2) indices as for SU(2) in 
subsection I[A4 and SL(2,C) in subsection I1A5.) These four generators form the 
subgroup GL(1|1) of OSp(1,1/2) (=SL(1|2)): We can write the generators as J7/, 
where J = (+,@). In subsection XIIB4 we’ll see that the singlets of this GL(1|1) are 
the same as those of OSp(1,1|2). 


On the other hand, because of the simplified form of these generators, it’s easy to 
see how to reduce the group even further: Applying some of the constraints on wave 
functions/fields to the right, 


I? =e? = 0, JOP age ae 
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=> J =~iz®dy+i+S%, J = 4290+ 5999, + $®%0q 


(Of course, this further reduction could have been performed even without the trans- 
formation.) We are now left with the group IGL(1), with just J®°° and J®~ as gener- 
ators. (J®~ acts as translations for the GL(1) generator J®°.) Also, we have reduced 


the unphysical coordinates to just x®. We now simplify notation by relabeling 
c=", t=0; Pais, Jarre Gas 


se | PhS Q = $c + $970, + S°*b 


J and Q are versions of the ghost-number and BRST operators introduced in sub- 
section VIA1. The net result for obtaining these IGL(1) generators from the original 
OSp(1,1|2) generators can also be stated as 


J Sid" | p0=0,8¢<5; Q = J* |ge-0,9+=1 


where 0 = 0 can be regarded as a gauge condition for the constraint x° = 0, and 
Ot =1 for x~0* (~ 0/O(In OT)) = 0. The IGL(1) algebra is 


LQ=Q, Q=7{2,Q}=0 


Exercise XIIB1.1 
Show that any IGL(1) subgroup of OSp(1,1|2) (J = iJ®°, Q = J®~) satis- 
fies these commutation relations. Check that this final representation of the 
IGL(1) algebra satisfies them. 


Exercise XITB1.2 
Using the results of subsection XITA2, 


a Give expressions for Q and J in terms of (c, b,) K, Q°, and S8, 


b Derive {Q*%, Q°}, assuming only Q? = 0 and the previous results for [S, S] 
and [S, Q]. 


2. Inner product 


The new inner product can be derived by the same steps: Starting with the 
lightcone inner product of subsection XIIA1, we add extra dimensions to get the 
OSp(D,2|2) inner product. We next drop dependence on x, which will be eliminated 
in the IGL(1) formalism. Then we perform the transformation with (0+)’7°° = 
(O+)/—! used to simplify the BRST operator. This acts on both fields in the inner 


product; applying integration by parts turns one such factor into (-O*+)~7. The net 
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effect is that it cancels the 0* in the Hilbert-space metric, which allows us to drop 


the x~ integration, and it introduces a factor of (—1)/. 


Rather than defining a Hilbert-space inner product, which is sesquilinear, it is 
slightly more convenient to define a symplectic inner product, replacing the Hermitian 
conjugate of the wave function/state on the left with the transpose, in analogy to an 


ordinary vector inner product. The inner product is then 


= ~i(-1)t faa de W" (x, c)(—1)’ E(x, c) 


A Hilbert-space inner product can then be defined simply as (W*|=) (where (W*)? = 
W'). We have included a sign factor corresponding to what would be obtained if the 
dc integration were moved to the symmetric position between the two wave functions: 
By (—1)” we mean take W = 0 in the exponent if Y is bosonic and 1 if W is fermionic. 


We can make this manifest by defining 
Vigo Cay). = F= -i f de dc |x,c)(—1)7 (x, ¢| 


which allows the inner product to be evaluated between (W| and |=) by inserting this 
form for the “identity” J. 
Exercise XIIB2.1 
Work out the inner product for the vector field in terms of all of the compo- 
nents (both expanding over c and separating physical and ghost parts of the 
OSp(D—1,1|2) vector). 
As a result, any commuting or anticommuting constant factor “a” can be moved 


out of the inner product from the left or right in the usual way: 
(W|Sa)= (|E)a, — (a|E) = a5) 
As a consequence of the anticommutativity of the integration measure, we have 


(=)? = Sl ae 


meaning that the statistics of the inner product is the opposite of an ordinary product; 


oP 


we can think of “|” as a fermion. 


Because of the change in metric from the lightcone, the IGL(1) generators now 


satisfy 
JP == J, Qr a Q 
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where the constant comes from dropping the extra coordinates, and the transpose 


“T” indicates integration by parts (the usual transpose in the infinite-matrix repre- 


sentation of operators): 
[voz 7 [cyreonme 


Before our transformations the generators all satisfied G’ = —G; now they are pseu- 


doantisymmetric with respect to the metric (—1)/, up to the constant: 


(W\OZ)=(OW|S) => O=(-1)0O7(-1) 


J=1-J, Q=-Q 
From J’ = 1-— J also follows the symmetry property of the inner product: 
(WIE) = (-)e MED (EI) 


This can be interpreted as antisymmetry once the anticommutativity of the “|” 


(metric) is taken into account. 


The hermiticity conditions that follow from the change from the lightcone are 
Pela 0 


Before all generators were antihermitian; now they are pseudoantihermitian, up to a 
constant: 
O = (-1)0r(-1)” 
> J=1-J Q=-Q 


The factor of i in the inner product compensates for the funny hermiticity of (—1)/. 


We then find the usual hermiticity condition for a vector inner product, 
(W|E)* = ("0") 


3. Action 


As explained in subsection VIA1, we are interested in states in the cohomology 


of the BRST operator Q, which means states satisfying 
Go = 0, df =iQA 


In particular, the physical states are states in the cohomology of Q at ghost number 


J = 0. However, now Q@ = 0 is the wave equation (as in subsection VIA3), and QA 
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contains the usual gauge transformations (as in the OSp(1,1|2) action of the previous 


section). 


The free gauge-invariant action for an arbitrary field theory is then 
So = (1 f ae de 5P' 5 QP = —(-0 f ae dc £1 Q65 oP 


for a real column-vector field 6. (Complex fields can be decomposed into their real 
and imaginary parts. For relating to quantum mechanics, we will usually consider 
the column-vector in our OSp Hilbert-space notation.) This action gives Q® = 0 as 
the equations of motion, and has 6 = QA as a gauge invariance, so the solutions are 
the cohomology of Q. The projector 6jo9 is a Kronecker 6 restricting ® to vanishing 
ghost-number (and thus Q@ to ghost number 1, since [J,Q] = Q). As we'll see 
in section XIIC, this projector is redundant: The states in the cohomology with 
nonvanishing momentum automatically have vanishing ghost-number, and the states 
with nonvanishing ghost-number are needed for gauge fixing. The projection is useful 
only for eliminating states that are redundant for discussing gauge invariance; we’ll 


drop it for the remainder of this section. The “complete” free action is then 


So = -- f ae de 467 (-1)"'Q@ = $(®|iQG) = Sy! 


where we have included the inner-product metric. This is just the translation of 
the free BRST operator from first- to second-quantized form, as for the lightcone in 
subsection XITA1. 


We now consider some simple examples, to see how this method reproduces the 


usual results. The simplest example is the scalar: As shown in subsection VIA3, 
Q=cx(O-m’?) = - f ac 50(—-1)" 'Q¢ = —363(O— m’)¢ 


without restrictions to vanishing ghost number, unitary transformations, gauge fixing, 
etc. Thus the scalar is in no way a gauge field: The kinetic operator follows from 


simple kinematic considerations. 


The fundamental example of a gauge theory is a vector: It is the defining repre- 
sentation of the Lorentz group, and of the extended Lorentz group we used to define 
the BRST operator. In the rest of this chapter we will see from its action most of the 
general properties of gauge theories: ghosts, gauge invariance, BRST transformations 
of the fields, the gauge-invariant action, gauge fixing, backgrounds, mass, etc. In 


addition to the equations given for this case in subsection XIIA5, we will use 


(@[9) =-@/®) = 
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The BRST and ghost-number operators are 
Q = 5D + ([) A] = [*)P[)Oa + 2I%) (|B, F = eb + a([®)(P] + |°) 1) 


The field is real; it can’t be called hermitian, since it is a column vector, but each 
component of that vector is hermitian. (This is the same as reality, but for anti- 
commuting objects reality includes extra signs that are defined to be exactly those 


coming from hermitian conjugation.) Thus, 


S 
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BT = Bi = (A,("| + 6C(°| — 1C(P)) + (Aa(*| - CP] — CPi 


oY 99 


where we denote the “antifields” (those at order c) by a 


The BRST transformations of the fields can be found by comparing terms in @ 
and Q@: If we define a second-quantized BRST operator Q such that Q& = Q@, but 
Q acts only on the fields while Q (as usual) acts on |*) and the coordinates (x and c), 
then 


B=|'\(bi— ich) > QG=QH=|')[(-1)'Q¢; — ic(-1)*"Qui] 


In other words, we compare terms in ® and Q@, and throw in a minus sign for 
transformations of fermions. (So, e.g., “QA,” is the coefficient of |") in Q@.) Dropping 
the “*” on Q, the result is 


QA = mee QC = 0, QC _ ~2i(C —_ $0 i A) 
QA, =-i(404,-0.0), QC =400+4+0-A, QC = -tOC 


Note that although Q is hermitian, it is antihermitian with respect to the inner- 
product metric (—1)/, as expected from our convention of using antihermitian gener- 
ators for spacetime symmetries. (The same extra sign for hermiticity vs. pseudoher- 
miticity, also because of ghosts introduced by relativistic quantum mechanics, occurs 
for the spatial Dirac matrices y': see subsection XC2.) As a result, our transforma- 
tions agree with those of subsection VIA4. However, while the first-quantized Abelian 
transformations also agree with those of subsections VIA1-3, the second-quantized 
nonabelian transformations will have the extra i demonstrated in subsection VIA4, 
following from the 7 introduced in the inner-product metric in the previous subsec- 
tion. (This minor yet annoying factor will be further discussed in section XIIC when 


we relate first- and second-quantized BRST.) 
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The Lagrangian then can be expanded as (after some integration by parts) 
iS / de 167 (-1)71Q6 = -i1(A-QA-CQC -EQC+ A-QA-CQC -CQC) 


=1(F,)? + (6 -—40-A?? -1C1000 + 14-80 
where we have used the transpose of the field on the left of Q@. To find the gauge- 
invariant action, we can evaluate it by keeping just the (anti)fields with vanishing 


ghost number (A, and C), and then eliminate the remaining antifields by their equa- 


tions of motion: 


Lo 2i(Fy)?+(C-40-AP > Le =4(Fu)? 


NIH 


Exercise XIIB3.1 
Consider the example of the second-rank antisymmetric tensor (see exercises 
I1B2.1, VIIIA7.2, and XIJA5.1, and subsection XA3): 


a Construct the states by direct product of two vectors. Decompose into fields 
plus antifields, physical plus ghost: In particular, note the Sp(2) representa- 
tion of each SO(D—1,1) representation. 


b Find the BRST transformations for all the (anti)fields. In particular, note 
that the tensor transforms into vector ghosts (as expected from the gauge in- 


variance), which themselves transform into scalar ghosts (“ghosts for ghosts” ). 


c Graph all the states for s (of the Sp(2) S$’) vs. J, and indicate there how 
BRST relates them. 


d Find the gauge-invariant action from Q@. 


e Generalize to arbitrary-rank antisymmetric tensors. Compare the results of 
exercise XITA5.1a. 
4. Solution 


The identity of the cohomology and the physical states can be proven most easily 


by making a unitary (“gauge”) transformation to the “lightcone gauge”: 
OQ, QT Taeherrenie 
which simplifies Q while leaving J unchanged: 


Q- 5cD - $*-0*, J —cb4+ 8° 
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These are the usual lightcone indices + of any D-vector, not to be confused with the 
+ used earlier when reducing from D+2 bosonic dimensions. Except for the extension 
to include the © index, this is the same transformation used in subsection IIB3 (and 
XITA1). 


This makes the generators separable, allowing us to treat the two terms in Q and 


J independently. Specifically, if we integrate the action over c, 
= ¢-icp = - f ac 167 (-1)7-1Q6 = —147(-1)° Od—iy? (-1)-18°- 0+ ob 


Then w is just a Lagrange multiplier enforcing the algebraic constraint S®-¢ = 0 
(ignoring 0*, which we always assume is invertible in the lightcone approach), leaving 
just the Klein-Gordon term for the part of ¢@ that satisfies the constraint. We also 


have the gauge invariance 
A=A+cy, 6@=iQA => 66= -iS° Ot, 6b =—-S* Oty — 501A 


so we can shoose the gauge where ¢ is restricted (algebraically) to be in the cohomol- 
oeyor Ss?” ; 

To solve for the cohomology of S®~ it is sufficient to consider the reducible repre- 
sentations formed by direct products of vectors (for bosons), or the direct products of 
these with a single Dirac spinor (for fermions), since by definition the OSp(D—1,1]2) 
generators S$” don’t mix different irreducible OSp(D—1,1|2) representations. We’ll 
show that this cohomology restricts any reducible OSp(D—1,1|2) representation to 
the corresponding reducible SO(D—2) lightcone representation, and therefore restricts 
any irreducible OSp(D—1,1|2) representation to the irreducible SO(D—2) represen- 
tation from which it was derived. (Also, the irreducible representations in arbitrary 
dimensions are most conveniently found by such a construction, where reduction is 
performed by symmetrization and antisymmetrization and subtracting traces of vec- 
tor indices, and in the fermionic case also subtracting gamma-matrix traces and using 
Majorana/Weyl] projection.) 

For bosons, we first consider the representation from which all the rest are con- 
structed, the vector. Writing the basis for the vector states as |’), where S“/|*) = 
|)", we find 

So =0. = noel) |") 


(= 5°" = mors 


This leaves only the transverse lightcone states, as advertised. For the direct product 
of an arbitrary number of vectors, we find the same result: The unphysical directions 


are eliminated from each vector in the product. 
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We might worry that extra states in the cohomology would arise from a can- 
cellation of two terms, resulting from the action of the “@” and “—” parts of S®~. 
Specifically, this could happen if we could separate out the supertraceless part of 
(the graded symmetric part of) the product of two OSp(1,1|2) vectors. (For exam- 
ple, for an SO(n) vector we can separate the traceless part of a symmetric tensor as 
= +i; Tkk-) However, this is not possible, since for OSp(1,1|2) 


str(d4) =2-—2=0 
Explicitly, we can look at the two likely candidates for extra states in the cohomology, 
17) = a®) 1°) 
(and their transposes). But using 
se |) =-19), $9) =-4F) 


for these states we find 


SEIT) + a) 1°) = -21°)17) 
a a = 


so neither state is in the cohomology. Note that we take S®~ to anticommute with 


|“); the states in the Hilbert space are assigned statistics. (This is the simplest way 


to allow a direct relation between wave functions and fields.) 


Exercise XIIB4.1 
Check this analysis for spin 3. 


Note that in the Lagrangian —4¢701¢ — iv S®-O*¢@ the fields in ¢ that are 
nonzero when acted upon by S®~ are auxiliary, killed by the Lagrange multiplier 7. 
On the other hand, the fields that are S®~ on something are pure gauge, and do not 
appear in the wS®~@ term because S®~ is nilpotent, while they drop out of the dé 
term because the fields multiplying them there are exactly the auxiliary ones that 
were killed by varying w. This follows from the fact that a field that is pure gauge 
with respect to S®~ has a nonvanishing inner product only with an auxiliary field, 
since dy = S°~\ > dod = G28" X. Equivalently, a field redefinition yy - w+ ANd 


can cancel any terms in ¢D¢ where one ¢ is S®~ on something. 


For the example of the vector, we have explicitly for the transformation to the 
lightcone 


ns = ly = HODa + HC] = PD 8] 
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under which the Lagrangian becomes 
Lol =-14-0A-i6400+ Cota -id-atc 


The lightcone gauge transformations are 


6At+=-Otr, 6C=-4D) 
6C=-ot¢, 6A =-10¢ 
6C= 50°, dAt=3ne, 6A’ = 50e 


a0 
a 


where here refers to the transverse (D—2) components. 


5. Spinors 


In general we can add nonminimal terms S$“? of subsection XILA2: The easiest 
way is to add them as the last step, remembering that Q comes from J®~ and J from 
J°: this yields 


Q = $c + $970, + 89° 45%, Jct SF+ 8° 
2 


This result can also be seen from first-quantization of spin 1/2 (subsection VIA3). 
Alternatively, if we add S$ at the beginning as in subsection XIIA2, performing the 


transformations given there, followed by the transformation 
r= ( oye. 


of subsection XIIB1, where J®® itself now contains S' terms, we again find 


Lo se Se, Jot = J7*+ ~iJj® 


Be OF as J®- + J* of +458 fal 
Adding a final transformation 
vee 
(which actually undoes part of an earlier one), we again obtain the above result. 


As in the previous subsection, we can also transform to the lightcone gauge, to 
find 
Q— ic — ®t + S® 


When analyzing the BRST cohomology in the lightcone gauge, the effect of this 


nonminimal term is to replace (again ignoring the factor of —0*) 


gAB = GAB _ GAB oe GAB 
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(although actually only the S®~ and S? parts are used here). 


As in subsection XITA4, when treating fermions we choose the Dirac spinor rep- 
resentation of OSp(1,1|2) for $48. Thus, for the case of spin 1/2, where $4? also 
is a Dirac spinor representation, SA® is represented by the direct product of two 
OSp(1,1|2) spinors. We now use the harmonic oscillator interpretation of the ghost 
coordinates used in subsection XIIA5 (extending it trivially to the fermionic ones), 


which can be applied to arbitrary OSp groups: 


GAB = AA}, G48 = 2154, 4} 


aay +i),  al4= Aly? 197") 
= fat at? = =i, GAB _ _gtl4g®) 


By expanding about the oscillator vacuum, we find this representation of OSp(1,1|2) 
consists of the direct sum of totally (graded) antisymmetrized tensors: |0),|4) = 
at4|Q), ||45)) = atAat®|0),..... But we have already treated this case for the bosons, 
the result being that only the singlet (vacuum) survives. Of course, the complete spin 
representation is given by the direct product of the representation of the unphysical 
variables (y4, 74) and the physical ones, namely the transverse lightcone gamma 
matrices. Thus, the states in the cohomology are given by the direct product of all 
the lightcone states with the vacuum of the unphysical variables. To treat arbitrary 
fermions, generalization to direct products of the Dirac spinor with arbitrary numbers 
of vectors works the same way, since the spinor looks like the direct sum of parts of 
GAB 


direct products of vectors as far as is concerned. 


Exercise XIIB5.1 
Work out the explicit Q and Hilbert space for spin 3/2. 


6. Masses 


As usual masses can be added by dimensional reduction: Our complete result for 


application to massless and massive, bosons and fermions is then 


Q = 4e(O — m?) + S®20, + S® him + S®®b  (+5°°), J=chb+S> (+53) 


with extra 2’s introduced implicitly by the procedure given in subsection IIB4. 


For example, for the vector we have (the “Stiickelberg formalism” ) 


Q = 53e( — m*) + (19) A] = 1) Oa + (PAT) Alm + 21%) (2 [b 


J = ch + i(l?)(P| + [P) PI) 
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Compared to the massless case treated in subsections XIIB3-4, the corresponding 


field now has the extra terms 
> 0+ |"')b—iel")b 


giving the action 


J = ch —iy’y? — FP 
where in the notation of subsection VIA3, 
P=& Pail, P=& Pail, F=-w 


Exercise XIIB6.1 
Use this method to work out the action and gauge transformations for massive 


spin 2. 


7. Background fields 


The coupling of external fields can be treated by suitable modification of the 
BRST operator. In terms of self-interacting field theories, this corresponds to writing 
the field as the sum of quantum and background fields, and keeping in the action only 
the terms quadratic in the quantum fields, as discussed for semiclassical expansions 
in subsection VA2 and for the background field method in subsection VIB8. 


One interesting case is the coupling of an external vector gauge field. Clearly the 
spacetime derivatives in @ must be modified by the minimal coupling prescription 
0 — V = 0+7A, but dimensional analysis and Lorentz covariance also allow the 
addition of a nonminimal term proportional to F®S,, to O. With the appropriate 


coefficient, the general result is (see subsection VIIIA3) 
Q7 = 3¢C(D — iF Sj.) + S°°V, + SP%b 


where O is now the covariant V?. (J is unchanged.) 
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In the case of spin 0, this modification is trivial. For spin 1/2, we substitute 
the graded generalization of the Dirac matrices, SY = —$[y',7}, as discussed in 
subsection XIIA5. We then find Q7 = 0 fixes the above coefficient of the nonminimal 
term, the same as from squaring y°V,. This follows from the simple factorization of 
SY in Qr: 


Qr = —e(y°Va)” — Y°(9°Va) — (79) = —(7°Va + Yb )e(7"Vs +70) 
where we have neglected the S®- term of subsection XIIB5, and used 
l*,e = {7%, c} = 0 


In the spin-1 case, we find the interesting result that Q? = 0 requires not only the 
above coefficient for the nonminimal term (as expected from supersymmetry), but 
also that the background terms satisfy the field equation V,F® = 0. On the other 
hand, for spins >1, Q?7 = 0 implies F,, = 0, so these spins can’t couple minimally (at 


least in flat spaces). Similar remarks apply to coupling gravity (spin 2) to spins >2. 
Exercise XIIB7.1 


Check these statements for spin 1. Compare the analogous result for back- 
ground fields in the field theoretic approach from exercise VIB8.2 for Yang- 
Mills for both the gauge transformations of the gauge-invariant action and 


the field-theoretic BRST transformations of the gauge-fixed action. 
Exercise XIITB7.2 


Show that electromagnetism can’t couple minimally to (massless) higher spins 


(i.e., they can’t have charge): 


a Show this for the graviton by considering Q? = 0 for spin 2 (symmetric 


traceless OSp tensor) in an external vector field. 
b Do the same for spin 3/2. 


Another interesting feature of the spin-1 case is that we can define a “vacuum” 
state 
0) = |°) 


which is in the free BRST cohomology of Q only at zero momentum (constant field), 
where Q simplifies to S®®b without background. However, this state has ghost num- 
ber J = —1. In fact, it corresponds to the global part of the gauge invariance of 
the theory: Gauge parameters satisfying QA = 0 have no effect in the free theory 
(where 6 = iQA), but can act in the interacting theory: They do not contribute 


an inhomogeneous term to gauge transformations. However, gauge parameters of the 


B. IGL(1) 847 


form A+ Q<£ have the same effect as A, up to trivial transformations proportional to 
the field equations. Thus, while the BRST cohomology at J = 0 gives the physical 


states, that at J = —1 gives the global invariances associated with the gauge field. 


Now the physical states can be derived by operating on the vacuum with appro- 


priate vertex operators: If we expand Q; about the free BRST operator Q, 


Qr=Q+V 
oO; =0 => {Q,V}+V’?=0 
6Q1 = [Qr, d] => 6V =1[(Q4+V,A] 


where A is the gauge parameter for the background field. The usual operator coho- 


mology, relevant for asymptotic states, follows from linearization: 
VoV => {Q,Vo}=0, bY =7[Q, I 


in the weak-coupling limit, where Vo is the part of V linear in the background fields. 


The asymptotic states in the cohomology of Q are then given by 
@=V|0), A=A\0) = QG=0, 12 =i10A 
We can check this explicitly, as 
V = ike({Ag, O°} + 1A? — F?Spq) +iAgS®* = Vol0) = Al") + icd(O*A,)|0) 


The second term gives C = 30 - A, in agreement with the free field equations. 


8. Strings 


Another interesting example is strings. Since first-quantization is essential in 
string S-matrix calculations, it’s natural to associate string field theory with quantum 
mechanical BRST. As usual, for massive fields this formalism automatically includes 
the Stickelberg fields that would have been found by dimensional reduction, as well 
as all the ghosts. However, the explicit expression for the BRST operator does not 
explicitly correspond to that obtained by dimensional reduction: Although the spin 
operators S®*, S®®, and S® are quadratic in oscillators, S®~! is cubic (because P~, 
and thus X~, is quadratic in the lightcone gauge). Nevertheless, the representation 
on any particular irreducible Poincaré representation contained among all the string 
states is the same as obtained by dimensional reduction, as follows from the generality 


of our analysis. 
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As for any Poincaré representation, reducible or not, all we need is the lightcone 
spin operators, given for the general case in subsection XIIA1. In subsection XITA2 
we saw that the OSp(1,1|2) generators followed immediately from just a change in 
notation. The IGL(1) generators were then found in subsection XIIB1 by a uni- 
tary transformation and solving half the constraints of GL(1|1); the net result was 


equivalent to applying “gauge conditions” to the original OSp(1,1|2) generators: 


J = iJ°°|90-0,0+=1; Q = J |ge=0,9+=1 


For the string, the gauge condition 0* = 1 simply removes the last vestige of XT, 
whose oscillator modes were already eliminated by the string lightcone gauge. On the 
other hand, the condition 0° = 0 makes X° = X as 


covariant objects: With the elimination of the linear 7 term in the expansion of X®, 


+X ie the sum of two conformally 


x is) are periodic in their arguments, and have the usual mode expansion in terms of 


exponentials only (no linear term). 

As a result, the decomposition of Q as obtained from the lightcone becomes 
trivial (J was easy anyway, since it’s quadratic): Relabeling the result for the string’s 
lightcone Lorentz generators from subsection XIB1, 


yoni f (Ox =) 


27a! 


Fey) 


(where now the “i” comes from using the antihermitian form of the Lorentz spin), 


separating X® into its (+) pieces, using their “chirality” X oes ( 


‘) to convert 


the 7 derivative into a o derivative, integrating by parts, and applying the definition 


Peay = yg (X + X’) 


2 | de RB 
_; e p- 
a=if= [POX Pe) 
In this form we can easily apply the Virasoro constraint, as solved in the lightcone, 


1 a 
P’ 2 
2KYV Tai! ) 


to X~, we obtain 


Pp? = 0, Pt=kv 2Qa'pt > P- =i 
where we have applied 0* = 1. Finally, we relabel 


xe 


a) = Ce) 


for purposes of identification with the usual BRST procedure in terms of ghosts C’ 
and antighosts B (see subsections XIB2 and XIB7); comparison of the (equal-time) 


commutation relations then gives the further identification 


(Pi) (1), Pi, (2)} = Fin'¥2m8'(2 - 1) 
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= [Piy(1), X4,(2)} = —ty/n¥2n6(2- 1) 


The final result is then 


1 da 5 
Q=230 f x Cw (-Hy #iClayBie) 


in agreement with direct first-quantization of this string in the conformal gauge, in 


terms of the constraints gare The ghosts can be separated into zero- and nonzero- 


1 jal v® / 
Cis) — Ze 7 TY Bis) = 2Kb + ayy) 
(For the closed string there is also an extra zero-mode in C and B, enforcing the 


constraint that the “+” contributions to M? equal the “—”.) 


Exercise XIIB8.1 


By separating zero-modes in the string’s @, and comparing with its generic 


modes as 


expression 


Q = 3ce(O — M”) + S®90, + iS 1M + Sb 


show that 


Of particular interest is the massless level: As mentioned in subsection XIA4, 
using the fact that the Hilbert space of the closed string is the direct product of the 
Hilbert spaces of open strings gives a simple analysis of the massless states of any 
closed string, since the massless states of any open string are given by a vector (multi- 
plet) plus perhaps some scalars in the nonsupersymmetric case. To find the complete 
off-shell structure, including auxiliary fields and ghosts, we can either take the direct 
product of the two lightcone representations and then add 2+2 dimensions, or first 
add the 2+2 dimensions and then take the direct product of the two OSp(D—1,1]2) 
representations. In the supersymmetric case the latter is more convenient, since the 
procedure of adding dimensions to superspace is not yet understood, but quantization 
of the vector multiplet is (at least for N=1, and probably for N=2, in D=4). 
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For example, for the bosonic closed string we just multiply two OSp vectors, 
producing a tensor ¢;;, which we can decompose into its symmetric traceless part 
hi; (graviton plus ghosts), antisymmetric part B;; (axion plus ghosts), and trace x 
(physical scalar): 


);)@|) 2 hg =tay— Sonit’n, Bij = tay), vet: 


(See subsection XIIA5.) However, we know that string theory prefers to treat fields 
in the string gauge for Weyl invariance, where the action (kinetic term) is not diag- 
onalized in the fields (until coordinate invariance is fixed also). This is understood 
from this direct product structure: The “natural” string fields are 

tab) graviton 

string gauge : tab) = axion 

t*, dilaton (T-duality invariant) 
since duality affects only the X modes, while the diagonal fields (representations of 
OSp(D—1,1]2)) are 


hab = Lia) = a Cae + ix) graviton 
normal gauge : Bab = tat] axion 


YH +e physical scalar 


Exercise XIITB8.2 
Show that the above OSp analysis is consistent with the diagonalizing field 
redefinitions of the low-energy string action found in exercise XIA6.2. Explain 
the result in terms of the redefinitions 
1 
p = er? ar i = J—og™ 
Vg 
In the heterotic case, as mentioned in subsection XIA4, we take the product of 
the real prepotential plus two chiral ghosts of super Yang-Mills with the usual vector 


plus two scalar ghosts of bosonic Yang-Mills: 
(V QD Pa) ® (Ag D Ca) =H, ® (Vo ® Pac) @ o) SP) P(a8) 


The result is a vector prepotential H, describing the physical supergravity and tensor 
multiplets (in a string gauge, G = 1), a chiral scalar compensator (“superdilaton”) ¢ 
appropriate for “old minimal” supergravity, first-generation ghosts (Sp(2) doublets) 
V, and ¢g., and second-generation ghosts (an Sp(2) triplet, for the tensor multiplet) 
$8). If the vector is accompanied by scalars, the closed string also has additional 
vector multiplets: 

(V @ ga) @ 91 = Vi ® bra 
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We saw in subsection XIIB7 that for Yang-Mills it’s natural to think of the (con- 
stant) ghost as the “vacuum” state. In open string theory, this happens automatically 
in conformal field theory. Thus, we again have a direct relation between the operator 
BRST cohomology and state BRST cohomology, creating states from the BRST- 
invariant vacuum with operators V = CW, where W has weight 1 and is constructed 
from X (and not C or B). 


We can also relate more general V’s and W’s by 
Q,fwW}=0 = [Q,W}=0V => [Q,V}=0 


which defines a V given a W. Thus ¢ V is the only part of V in the cohomology, 
the rest being a BRST variation: Moving V to a different value of z is a gauge 


transformation. Thus V(z) has explicit z dependence, while Q does not: 
Q1=Q+V(z) 
corresponding to a vertex local in z (as found in subsection XIB7). Conversely, 
O,V}=0, W=([fB,V} => [Q,W}=[{Q, $B}, V} =[$T,V} =—-OV 


as long as V has w = 0, and generally under some weaker restrictions. Thus this 


choice of gauge transformation 
A~W => 6V=([Q,A}~ OV 


allows translation of V(z) to arbitrary z. We can see the relation of these 2 operators 


from the external field approach: 
Q-Q4+V => §T->§$T4+W 


so V is a background term for Q, while W is a background for the “gauge-fixed 
Hamiltonian” ¢ T. 


The gauge can be generalized: 
Hy = H+W(z) = {Q7, A} = {Q, A} + {V, A} 


where the gauge-fixing operator is 
A= [(B+f) A? =0 


the first term in A giving the usual $(p?+ M”) in H, while f is left arbitrary (as long 
as A? = 0 is preserved). The inverse relation for W — V is unchanged as long as V 


transforms the same with respect to the new T. 


852 XII. MECHANICS 


9. Relation to OSp(1,1/2) 


For comparison to OSp(1,1|2), we perform a unitarity (gauge) transformation on 


the IGL(1) action. (The most general OSp(1,1|2) expressions are given at the end of 


subsection XIIA2.) We first define an almost-inverse of S®®: Since S®® annihilates 


states with s* = s, where we define 


—~15%S.6=48(s+1), _$°|s) = 283|s) 


(so that s and s? take their usual integer or half-integer values), we can define §ee-1 


such that 


We then apply the transformation 


Q mae Q diag — UQU* 


Q=-cK + Q°+S%b, = InU =c{{S®*"!, Q®} — 


God 1 GSS C® G4 


The exponent of U is nilpotent from the c, so it generates only a linear term, U = 


1+/n U. Using the commutation relations from subsection XIITA2 


[Se®, Q*| = 0, (Q®)? = kK gee 


we find 


Quiag = €5s3,6(—K + Q°S9°-1Q®)5,3_. + (chO®5,5,, + bedys,_,Q®) + S®b 


Now we apply the identity 


A 


[S°,AJ=0 =>  6,3,,46,3,5 = Adso 


3 3 ! 


since any matrix element between (s, s°|...|s’, 8°) gives s = s? = s’ = —s' 3s = 


s’ = 0, as well as the facts 


SPF 9 Gog = [S®®, Q?]5.0 = 2Q%5q9 => $$ PP-1Q5.y = —U QS 


{Q*, 215.0 = 
This yields the final result 


Qaiag = —C(K + 10°Q,)5e0 + (CO%5,2,, + 605.3, 2%) + S°% 
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Exercise XIIB9.1 
Use the commutation relations of S°%, as well as eo Be = 4s(s +1), to 
derive 


§9° $%® — d[s(s +1) — s*(s* — 1)] 


from which follows the explicit expression 


oe - G00 _ 1 ~ 058,5 gee 
gee gee A(s — s3)(s +s? + 1) 
Integrating over c in the action Saiag = — [ $27(—1)7"*QuiagP as in subsections 


XIIB3-4, we find the Lagrangian 
Laing = 397 (K + 42% Qa) 5.00 + Hb" (—1) 855, ,0%b — $7 (—1)* SPP) 


(Note that in a product of the form 77x a state of eigenvalue s? multiplies one of 
eigenvalue —s*, since ($?)t = (93)? = —S3.) We now sce that in this action only the 
s = 0 (physical) part of ¢ appears in the #¢ term, while only the s? = —s (“minimal” ) 
part (including physical) appears in the ¢y term. The only part of w that appears in 
the @w term is the s? = s (minimal) part of w (the “antifields” to the corresponding 
ones in @), while all, but only, the remaining (“nonminimal”) part of w appears in 
the ww term. In particular, the ¢¢@ term is recognized as the OSp(1,1|2) action of 
subsection XIIA3. The terms involving w can be eliminated by w’s gauge invariance 
and field equation, and w contains no propagating degrees of freedom (fields with 0 
equations of motion), as can be seen by the methods used to analyze the cohomology 
in subsection XIIB4. However, the auxiliary fields ~, and the ghosts in (the s 4 0 
part of) ¢ to which they couple, are useful in gauge fixing, as we’ll see in the next 


section. 


Again looking at the example of the vector: 


GSO — g|®) (2 | = 9eo-1 _ ioee = ae 
QO? = (I) - I). = In U = Age(|) 0] + |*)(P1) Oa 
The result of the transformation is then (cf. subsection XIIB3) 
Qaiag = 3¢(O — |*)("|8a9e)5s0 + cb|®) (*]Ba — be|*) (?|Ba + 2|°) (Pb 
Léiag = 4(Fap)? + 1A - OC + C? 
The BRST transformations now simplify to 


@A2-580 OGCet. OC2 iC 
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QA,=-i10R,, QC=0-A4, QG=0 


Exercise XIIB9.2 
Find the ghosts and simplified BRST transformations for massless spin 2. 
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Although the quantum mechanical BRST operator is clearly useful for gauge fix- 
ing, its relation to the second-quantized BRST we applied in chapter VI is not obvious, 
since the latter BRST operator does not include the gauge-invariant action. Here we 
relate the two, and extend the former to interacting field theories. In particular, we 
show how the $Q@ action leads directly to the gauge-fixed kinetic term as simply as 


it led to the gauge-invariant one, without applying any transformations. 


1. Antibracket 


In the usual Hamiltonian formalism we work in a phase space (q, p) on which is 
defined a Poisson bracket, useful for studying symmetry properties and equations of 
motion in the classical theory, and for relating to the commutator of the quantum 
theory (see subsections IA1-2). We want to interpret the present case of interest as 
an analogous phase space, for which the fields ¢@ (in © = @ — icw) correspond to q 
and the antifields ~ to p. This automatically follows from the lightcone commutator 
of subsection XIIA1, by the same steps used to derive the IGL(1) algebra and inner 
product 

(|=) = —i(—1)” fe de 8" (x, c)(—1) 5 (2,0) 


in subsections XIIB1-2: We thus define this generalization of the Poisson bracket (see 


ae 
ips 


where we have expanded the column vector @ over a basis in the usual way (see 
subsections IB1,5), 


subsection [A2) in terms of the inner product as 


— 


t) 


(f[P], 91) =fog, c=n”? car 


& = |'\@,, 6 — g!(, 
Capt a=ela, Gi Fan, 7 neSo; 


etc., and the indices J, J (not to be confused with the ghost-number operator J(c) 
appearing in the definition of the inner product above) run over all indices on the 
field, which determine the statistics of the corresponding component as (—1)/. (Thus 
® is always bosonic, the statistics of the fields coming always from expansion over |/).) 
This generalized commutator “(, )” is called an “antibracket” because of the unusual 
statistics associated with it, following from the same unusual statistics of the inner 


product. (The ordering of indices on 7/7 in the antibracket is the opposite of usual 
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to take into account the extra sign factor from the intervention of the anticommuting 
“|” between the two @’s.) Plugging in the definition of the inner product, we have 


more explicitly 


<— 


6 é6 
oS if av dc ia (Saree 

Note that while the inner product is defined between two functions of the coordinates, 
the antibracket is defined between two functionals of @, f and g, which don’t depend 
explicitly on (a,c) (although we can specialize to cases where they depend on other 
values of the coordinates (2’,c’)). Thus the orbital term cb in J(c) acts only on the 
argument c of /(x,c) (and not on g), while the spin term S? acts only on the index 
J of © @, ce); 

We have used the fact that 6/d@ is antihermitian, as follows from the fact that 
the graded commutator between it and @ is always a commutator, since they always 


have opposite statistics: 


6 Fo atae re) a + 
awe = (saree) 
— ep er e')] = §75(x' — x)5(c —c) 


n ON (6 8 
6b) \dbd) 86 


6 .\t Toe .)t 5 5 
(i) foe “lear 


Other properties of the bracket follow directly from those of the inner product: 


Thus, 


(—1)49 = (-1)ft9 


(f.ga)=(f.g)a, = (af,g) =a(f,9) 
Gg ==EDYrrnr Gs) 
(f,gh) = (f,g)h+ (-1)4*89(f, h) 
Deen Oa) aaa =0 
(f,9)' = (91, f") 


(for some commuting or anticommuting constant a). Thus, the bracket has the exact 
opposite symmetry as the inner product (as is the case with the usual brackets): 


It would be symmetric in its two arguments if not for the [ dc that sits effectively 
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between the two arguments. Most of the properties follow from this fact, and that the 
signs obtained from pushing things around are determined by moving things naively 


while treating the in the middle as anticommuting. Furthermore, the existence 


of a bracket with these properties allows the definition of a Lie derivative, 
L~AB = (A,B) 


Exercise XIIC1.1 
Find all the usual properties of this derivative (statistics, linearity, distribu- 
tivity, hermiticity, algebra, etc.), and relate to the usual Lie derivative. 
We also have functional identities such as 
) 


5B (ao) OM) = 181 (1) A(z, 0) 


(#2, 0), 1B) = -i(-1) ne Fa 


from which follow 
(@' (x, c), @7(a', d)) = —i(—1)7 76(e — 0)6(a' — x) = ~i(-1) "(ce +c')6(x — 2’) 
as well as 

(P(x, c), (P|W)) =W(z,c), (WS), (P|E)) = (WIE) 


Here W and © are wave functions in the same space as ®, but need not be taken as 
bosonic (or real): We can even take them as functionals of when applying the chain 
rule, using the above expressions for the terms where the 6/d@’s don’t act on them. 

Expressions quadratic in @ will be used to perform the second-quantized (or just 


classical field theoretic) version of linear first-quantized transformations: 


On = 5(D| AP) > (Oa, Oz) = OA,B}; A®@ = (®, Ox) 


df = - f ae dec (1lao! Sf => (O4,f)= fu dc (148) 


where A and B must satisfy 
(1a? = (199A 3 A=—(-1)OAT(-17O 


to give nontrivial contributions when appearing symmetrically between the two fac- 
tors of &. Corresponding group elements come from exponentiating bosonic first- 


quantized generators, yielding fermionic second-quantized generators: 


6f =(O4,f)=Lo,f > fl seat 
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(-1)*=1 53 (-1)9%=-1 = (Oa, f)=—-(f,0a) 


Clearly, the latter relations must hold when replacing O, with their nonlinear second- 


quantized generalizations. We then find (also for bosonic A) 
A® = (®,04) = —(Oa, ®) 


where the minus sign is the usual from translating first-quantized to second-quantized 
language (see subsection IC1). 


Expanding (@,@) in c as 
B! = o! — ten! (—1)*"d5 
we find 
(b1(2), 6” (2')) = —(¢" (2), br(a')) = 675(@ — 2’) 
This allows us to reexpress the antibracket as 
55 6 6 
== d —1 y = —_ + —_— 
fe) / ot ) 5, Ob! bo! 6d; 
For example, the antibrackets of the component fields for the vector are 
= |')G; = |*)[njA? —ée(-1) Al] = (9) Aa—a]°)C +4[*)C) —se(|*) A, — |°)C—|®)C) 
(Ai(z), A?(2')) = —(A? (a), Ai(e’)) = G6(@ — 2’) 
= (Aa, Ap) = Nab); (C, C) 0, ‘Gl C) =0 
where now “i” refers to the OSp(D—1,1|2) index (and we use C = A®, C = AS, 


C = Ao, C = Ao). 


2. ZIBV 


To prove gauge independence of the path integral, it’s useful to draw an anal- 
ogy of relativistic quantum mechanical BRST to second-quantized BRST. (We’ll see 
below that this is not just an analogy, but an equivalence.) Translating the BRST 
quantization of subsection VIA2 into path integral language, the general Lagrangian 


path integral for BRST quantization in quantum physics is 


A= | Da et" Ss’ = $+{Q, A} 
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where q is all coordinates, including ghosts. While S and A depend only on q, the 
BRST operator is linear in the conjugate momenta p — It generates a coordinate 
transformation: 


(Q,0d"} =-t6gg” => Q=(dgq™)Pm 


Here the index “m” includes all dependence of q, including time. (We saw a more 
explicit expression of this result in subsection VIA2, assuming the constraints G; are 


themselves linear in the physical momenta.) We also have 


[Q; S] = 0 


where S' can include not only the gauge-invariant action, but arbitrary additional 
gauge-invariant pieces. Such pieces can be used to construct states in the BRST 
cohomology from the vacuum. (This is the path-integral translation of the operator 
construction given in subsection VIA1.) It can also include pure BRST variations, 
{Q, Ao}. Thus, to prove gauge independence of A, we need only prove the vanishing 


of its variation under infinitesimal change of A, 
[rs e °{Q, 6A} =0 


But this is trivial, since Q acts as a total derivative, and [Q, 5] = 0. More generally, 
we require only 


Gas 


0=1-Q—4{Q,S] = -i(599") Im — (599) mS 


where 0,, = 0/Oq™, and “1- Q ” means the derivatives in Q act backwards onto the 
1 (and itself), as found by integration by parts. In cases we have considered (and 
almost always), 1- Q and [Q, S] separately vanish. More generally, since there is a 1/h 
multiplying S implicitly, nonvanishing values would require a “quantum correction” 


to S. We can also write this condition as 


er ost 
Furthermore, we can write ; 
= O 

em 

ODmOg™ 


These manipulations can be applied to the field theory expressions 7 for group 
generators, as found in subsection XIIA1 for the lightcone: The BRST operator as 


found from these generators is 


S=}(PiQ%), Q=0 © (S,S)=0 
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Then a unitary transformation can be implemented by exponentiating the infinitesi- 


mal transformation 
6Q=[G,Q] & JS =3(P|i(6Q)S)=(G,S), G=3(O|GS) 
J=eQe° & SH =F(oiQ'e) Ses 


We want to implement gauge fixing by performing a unitary transformation on 
Q and then evaluating S at the antifields wy = 0: 


A = [6 a at, oe = As => Sof = (e“4S)|y—0 
By similar manipulations to the BRST case, we see that gauge independence means 
= / Do e**(S, 5A) =i / Do (e7*8, 5A) 


where we evaluate this expression at y = 0, and we have again included arbitrary 
gauge-invariant pieces in S. We thus obtain gauge independence from (S,S) = 0, or 


more generally (again using integration by parts) 
6? og 
a= fu (-1/)————e* = pu (-1/ + is(S, 9) 
60169! 60100" 
This is the approach to BRST of Zinn-Justin, Batalin, and Vilkovisky (ZJBV). 
Exercise XIIC2.1 


Find the unitary transformation, in ZJBV language, that transforms the 


“untransformed” action for the massive vector (that which gives the gauge- 
invariant action upon dropping antifields) into the action that has only the 


vector field (and not the scalar) upon dropping antifields. 
Writing the BRST transformations in this second-quantized ZJBV notation will 


allow us to gauge fix interacting theories (found by adding interaction terms to the 
free S') in a gauge-independent way. In particular, it proves the equivalence of the 
manifestly unitary lightcone gauge (which has no ghosts, only physical degres of free- 
dom) to the manifestly Lorentz covariant Fermi-Feynman gauges (where the kinetic 
operator is simply  — m2). Specifically, the Q@ action is already unitarily trans- 
formed to the Fermi-Feynman gauge: Keeping just the ¢ terms, we have for a bosonic 


theory 
Spr = S|y=0 = - f dx de $6" (-1)""'c3(O—m?)6 = - | de 1g? (1) 2(O—m?)¢ 


In other words, the Fermi-Feynman kinetic term is just the sum over all fields (but 


not antifields) of a 0 — m? term (using the OSp(D—1,1|2)-invariant inner product: 
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the (—1)*° is just a sign, and can be absorbed by a field redefinition). This can 
also be seen from the result for the complete Q@ action after dropping the antifield 
terms: For example, for a vector the gauge-fixed (free) action is simply (see subsection 
XIIB3) 

Lpp = —+A°DA, — iC500C 
(The Fermi-Feynman gauge for fermions also gives a 0 — m? kinetic term, but with 
an infinite number of ghosts; this may be useful for supersymmetry.) 


Exercise XITC2.2 
Let’s again consider arbitrary-rank antisymmetric tensors (see exercises 


XITA5.1 and XIIB3.1): 
a Find the Fermi-Feynman actions. 
b Do the same for the massive case. (Note: There are more fields.) 


On the other hand, we saw in subsection XIIB9 that a unitary transformation, 
and evaluation at w = 0, gave the gauge-invariant OSp(1,1|2) action in terms of just 
the physical fields: 


Ao = 5(P|c[{ Ss? 1, OY be Se" 1 gOS OF G4 4) = S' = Saiag 
Saag = | de (36"(K + 42°Qn)8a06 + iv (-1)5"15,,.9% — uT(-1)*"5°°u] 


Soi = Saiagly=0 = fe ao (K + 7Q"Qa)da0¢ 


In the usual gauge-fixing approach, we would start with Sygiag and do the inverse 


transformation to obtain the Fermi-Feynman gauge: 
A=-Ap => Spr = (e~“40S¢iag)|y=0 


The same A can be used in the interacting case, since the effect on the quadratic 
piece of the action will be the same. Thus, to apply the usual ZJBV procedure we 
can either start with Sgiag and apply some A ¥ 0 sufficient to fix the gauge, or we can 
start with S (the one we found from quantum mechanical BRST) and apply some 


equivalent A, or no A at all (for Fermi-Feynman gauge). 


To compare with the lightcone gauge, we start with 
Ste = pe [-16(-1)* Od — i(—1)-1S®- arg) 


which was itself obtained by unitary transformation from S (see subsection XIIB4), 
and make a further unitary transformation, of the form A = f dr $¢AD3¢, that has 
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the effect ~w — w+ AD¢ for A such that all terms in ¢O¢ containing auxiliary fields, 


and thus also pure-gauge fields, are canceled. For this transformed 5S). we then have 
Shadi ist = ex +¢105(S4)¢ 


where the projection operator 6(S4") picks out the singlets of S47 (A = (+,a)), 
i.e., the transverse (physical) degrees of the light cone. A similar procedure can be 


applied in the interacting case. 


For the example of the vector: 


In U = —i5e(|°)(*] + 7) (P/) (07) “0 
Qie = 3c) _ Se- or —? ie diag = 5cO|") (| —_ Soo" 
Lie —_ Dijettiag = —<A,DA; + Get Aq = iA atC 


which has just the transverse (lightcone) degrees of freedom when the antifields are 
dropped: 
Ligoy = —7A,DA; 


3. BRST 


In subsection VIA2, we saw that BRST could be used to gauge fix by adding a 
BRST variation to the gauge-invariant Lagrangian. In that case, physical states are 
those that are not only in the BRST cohomology, but also satisfy the equations of 


motion. 


On the other hand, for relativistic mechanics we saw that the equations of motion 
are rather redundant, since T is unphysical, and so p?(+m?) = 0 is already included 
as a constraint, and contained in the quantum mechanical BRST operator. In fact, 
we have seen how in the most general case of a free field the correct spectrum is 


specified by just the cohomology of the quantum mechanical BRST operator. 


We therefore want to identify the quantum mechanical BRST cohomology condi- 
tion with the combination of the second-quantized BRST cohomology condition and 
the wave equation. This essentially has been accomplished in subsection XIIC2 by 


decomposing Qiag With respect to c, as we'll now see by some further analysis. 


From subsection XIIB9 we have 
Qdiag = —C(K + $2O°O,)550 + (cbQ°5,3,5 + bc5,3,-5O%) + Sb 


where the first term gives L,; in terms of the physical part (s = 0) of ¢, the second 


term gives the minimal BRST transformations in terms of the minimal (anti)fields, 


C. GAUGE FIXING 863 


and the last term adds the nonminimal stuff needed for fixing to general gauges. In 


particular, we see that the BRST transformation of the physical fields are 
(dso) aa 602° = 650 Q® (65,1/2653,-1/20) 


(Note that this differs somewhat from the expression found from Q, since the trans- 
formation to Quiag 1s effectively a redefinition of ~, adding to it a piece proportional 


to an operator on ¢.) 


Thus the only occurrences of the physical fields in Qaiag are in the term that 
gives the gauge-invariant action and the term that gives their BRST transformation; 
the remaining terms introduce nonminimal fields, as well as account for the BRST 
transformations of the ghosts. But this is the definition of BRST: Take the classical 
action in terms of physical fields, construct the BRST transformation from the gauge 
transformation that leaves the classical action invariant, add terms to the BRST 
operator that insure its nilpotency on these ghosts, and add nonminimal terms to 
allow gauge fixing. We have just seen that Quiag is exactly of this structure, where 
gauge fixing gives the desired Fermi-Feynman gauge by unitary transformation to Q 
(which becomes a canonical transformation in second-quantized language, using the 
antibracket). 


All that is left to see is how the ZJBV combination of the gauge-invariant action 
with the BRST operator is equivalent to ordinary BRST. Expanding in antifields, 
ZJBV gives the gauge fixed action as 


where we have used the fact that the three terms in Qiag contain only the physical, 


minimal (“m”; including physical), and nonminimal (“nm”) fields, respectively. In 


the usual ZJBV and BRST formalisms, derived from BRST without antifields, there 


is no w? term, since this generates a BRST transformation 


One instead introduces further nonminimal fields, the “Nakanishi-Lautrup fields”, 
such that 


Qédnm —= ONL 


and use an extended gauge-fixing function 


A — A i OnmPNL 
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Then the gauge fixed action is 


Sop = So + {Q, A} = So + (6A/5Gm)(Qbm) + (5A/5Gnm) One + Ont 


After eliminating the NL fields by their (algebraic) equations of motion, we obtain 
the same result as found from ZJBV. (Of course, the NL fields can also be introduced 
directly into the ZJBV formalism, but are redundant for purposes of finding Fermi- 
Feynman gauges. ) 

Consider the special case of Yang-Mills: Generalizing our results for free Yang- 
Mills to the interacting case, making use of the BRST transformations of subsection 
VIA4, we have 

Lzspv = 3(Fw)? + C2 +iA. [V,C] —COC? 


The basic antibrackets are 
(Aa, Ap) = Nab; (Cc, C) = 0, ic, C) =0 


From the general relations we saw earlier, and the definition of S in terms of Q for 
the free case, we have 

iQ® = (8,8) = (S,9) 
Since in the above we have pulled out factors to the left of the fields, as 


3 ~y 


& = |'\, = I") nj? —ic(—1)* Aj] 


we pull them out of the left of the antibracket, to obtain 
i(Q¢)' = (8,9), i(Q¢)" = (5,4) 


where as before (Q¢)/, etc., means to evaluate the corresponding component of Q@ 
and introduce the corresponding signs for effectively pulling those factors to the left. 
We then find the previous results for the BRST transformations of the fields (by 


construction), but also those of the antifields: 
QA,=-[VaC), QC=i0?, OC=—210 


QA, = -i4[V", Fal +i{C, Ag}, QC =[V.,A] +110,C], QC =0 
(Remember that the funny signs of the antibracket come from its symmetry, plus 
treating the comma in “( , )” as anticommuting. Note the generic terms i[C, }.) 


Exercise XIIC3.1 
Generalize the above results for the action and BRST transformations with 


antifields when Yang-Mills is coupled to matter. 
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BRST was described in a different way in subsection VIA4: Here we apply quan- 
tum mechanical BRST, and find it equivalent to applying the ZJBV form of BRST 
to second-quantization. The ZJBV action consists of the gauge-invariant action, plus 
the antifields times the BRST transformations of the fields, plus (antifield)? terms. 
The difference between the BRST transformations obtained by the general methods 
of subsection VIA1 as applied to second-quantization, and those found in this chap- 
ter by applying OSp methods to first-quantization of relativistic systems, is that the 
Nakanishi-Lautrup field is treated as a field in the former approach and as an an- 
tifield in the latter. The two give equivalent results: The latter uses fewer fields, 
but is slightly more restricted in choices of gauge; however, this restriction is avoided 
in practice. (More “nonminimal” fields can be added to allow more general gauge 
choices in either case.) For example, the ZJBV action for the former treatment of 
Yang-Mills can be obtained from that for the latter by the replacement 


C os CB 
The gauge-fixed action is the canonically transformed action (with respect to the 


antibracket) evaluated at vanishing antifields: 


Sop = Sz spy 


Consider Yang-Mills in the most common type of gauge, where some function of A is 
fixed. From the usual BRST approach (see subsection VIA4), or the ZJBV approach 
with B, we find 


A=tr [ 3C(f(A) + $08 = Lop = Los — 4Bif(A) + 403] -HeF vc 


while in the ZJBV approach without B we have 
Z ~,O 
A=tr f 3C7(A) = Ligg = Lyi + AAP — HOLE “IVC 


which is equivalent to the previous for positive a (after elimination of B). 
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sSadewadiwnearsisouisaies oisde: PTRCP INIA a icxiacavccessseiscrecwoevsenzs 
Conversions to other common conventions 
Nab — —Nabs Ya Fa Vai = — Fis, 2 So 
2 g? 2 d?x g 


“2g via $(m’*-O)-m’?-O, —,;—-4d’2| or nonabelian 
TT 


(27) P/2 167? 
Natural (Planck) units 


=== = = = — = 861.0225762(29), Ht =1.43(7) x 10% 
(a; 


1 kg = 2.59225(20) x 10", 1m =1.096651(82) x 10°, 1s = 3.28768(24) x 10 
1 K = 3.98216(30) x 10°*°, 1 GeV = 4.62109(34) x 10-7 


Indices 

a, b,c, ... — (flat) vector 

i,j,k, ... — transverse (D—1 or D—2) vector or internal 

m, 7, p,... — (curved) vector or large summation 

A, B,C,... — (flat) super or conformal vector 

I, J, K,... — internal 

M,N, P,... — (curved) super 

A, B,C, ... — conformal spinor 

a, 3, -+5 f, V; 7, --- — spinor (usually 2-valued) or fermionic 


l,& — internal 

0 — time 

—1— mass (dimensional reduction) 

+; t,t — lightcone (longitudinal; transverse) 


&,© — spinor or spacecone reference line 


Integration 
_ me dp 
fs fogom [= | ogo 
5(a—a') = (2n)??6?(e—2'),  6(p—p') = (20)?6?(p—') 
(z|x') = d(a —2'), (pip!) = 5(p—p'); (alp) =e'”*, (plz) =e"; pa = 10, 


d(p — p’) 


on — shell : (p| |p’) = amok (p+ m2)] 
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GROUP THEORY (I,X): Covering groups 


§0(2) = U(1), SO(1,1) = GL(1) 

$0(3) = SU(2) = SU*(2) = USp(2), $O(2,1) = SU(1,1) = SL(2) = Sp(2) 

§0(4) = SU(2)@SU(2), $0(3,1) = SL(2,C) = Sp(2,C), $0(2,2) = SL(2)@SL(2) 
$O(5) = USp(4), SO(4,1) = USp(2,2), $0(3,2) = Sp(4) 

$0(6) = SU(4), SO(5,1) = SU*(4), $0(4,2) = SU(2,2), $0(3,3) = SL(4) 


SO*(2) = U(1), SO*(4) = SU(2)@SL(2), SO*(6) = SU(3,1), SO*(8) = SO(6,2) 
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LORENTZ (I,II) 


—m? =s p = DP Nab - =(p°)? | (p')? (p?)? (p*)? _ —2p'p as 2p'p' 


E=p'=—po; pe = q(p? +p"), pi = Za(p" — tp’) 


—ds? = dx? = dx*dz’n», pds = mdx*, pdr = dx 


ptm = Sale pe + Sg-1m + wp, = S_1°D, + wm = 0 


2-spinor 
re Or Ce ad ae ae 
Wa=WCoa, a=VPCp,; 0? = Ada = WY? = -ihevo 
p= (ot > (Walt=—Ba, (BH )t = BP = bytds = fH 
Ata = Aap — Aga = CapC®Ays, Alas =0;  V2=-2det V=VV,, 


CopCai> aa pir.ab = UCapC6Cy5C 54 — CasCayCy3Cs3) 
W=U%Xalr W) =I da Wl= ola, ld] = [he 
V=()VaPlg V¥=-lVFalsl F=l)f%l f= lela 
(bx) = (xd) = ¥%Xe, x) =¥*xe; Wx) = fbx] 
WIVIX] =¥VaPxg, lf) =v faxes, VW*+WV* = (V-W)r 


ie Se SS e(V,W, X,Y) =i tr(VW*XY* —Y*XW*V) 


Neé,63 = 


4-spinor 


Wa 7 a aa a a a 
seat , Pew Te ory) ey agi +s” 


0 Vie 0 Cs ~id8 0 
tee : ) r= Vive= ( oo ) aa a) 
VW ya=-2, Yd%a=4, Yahya=a-b,  y°Abeya = Abd 
tr(I)=4,  tr(d#)=—2a-b, — tr(dbdd) = a-be-d+a-db-c—a-cb-d 
Y= |) ba = ber w = yl = ae ie o*[3| 


Yep =—lalgl—[aMali = He = Mab — = IYI 
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ACTIONS (III-VI, XII) 


L=-lPg(q)+GAq+U@,  S= dt L 
Ly = —qp+ H(q,p), Su = f at H — dq p 
A= [ e Pw; Wick : A= [6 e°W, S>0 
Mechanics 
OSp(1,1]2): 


Ke-7(O=m), OF =5"0,+5° om [yy - 7h) 
S08 = $08 (594), — 5.87 Sa5 = 45(s + 1) 
aa iu dx Ly, Lg =30"Kyd, Ky = 3(-O+m?+30°9,) — ($*°¢ = 0) 
5b = b5059° Ag 
Sp= / dx Logis, Legg = 30K gig, Koig = 31° (Sa*Oa + Sasi) 
IGL(1): 
Q = 3ce(O — m) + $0, + S@ Tim + 8%%b (48°), J =chb+ 8? (+8°) 


c= eu? fi dx de 46" (-1)!1Q6 = GliQS), Sy = Slyo=o 


Ser = Sheno =~ f de $6"(-1)" (0 mo 
Quantum ChromoDynamics 
Gi=Gi, [Gi,Gj])=—ifi*Ge,  (Gib)a = (Gila? oe 
Va = 0a + tAg = 0, + iAa'Gi, —i[Va, Vo] = Fay = Fu'Gi = Oa Ay + i[Aa, As] 
L= btr FR e+ L(V,v), — tro(GiGj) = iy, tra(GiGj) = 2N6,; 

QA, = —[Va,C], QC =iC?, QC = -iB, QB =0, Q¢ = iC (for 5¢ = idd) 
Sof = Si -iQA, A= ur | EC(fttaB) => Lop = Ly —4B(f+4aB)+4hiC(6f)|—c 
Latajorana = PIVa Pat Sg (b Yat oa) > FOO - mM \ha — 7 fal’ bp 

Lpirac = VOY + SEY = (PV aha + KV aXe) + (2X0 + Pr Xa) 
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FEYNMAN (V) 


N 
S=5+5, =f 40K  Av=T] (> Z\¢) 


a=): 0g 
Z|] = A [6 e7 (Sol¢}+Srlot+e]) — exp [ssEx eSilyl 
250 K dy 


Effective action I'[¢] (unrenormalized): (E.g., L = —+¢(O1 — m?)¢ + 29¢°.) 
A1) 1PI graphs only (plus So). (For W[@], connected graphs only.) 

A2) Momenta: label consistently with conservation, with { dp for each loop. 

A3) Propagators: 1/K for each internal line. (E.g., 1/5(p? + m?).) 

AA) Vertices: read off of —S;. (E.g., —g) 

A5) External lines: attach the appropriate (off-shell) fields and f dp, with (3° p). 
AG) Statistics: 1/n! for n-fold symmetry of internal/external lines 


( 
( 
( 
( 
( 
( 


(or keep just 1 of n! related graphs); —1 for fermionic loop; overall —1. 


Vacuum: (Renormalize before for minimal subtraction/after for MOM.) 
(B1) Find the minimum of the effective potential (for scalars). 
(B2) Shift (scalar) fields to perturb about minimum; drop constant in potential. 
(B3) Find resulting masses; find wave function normalizations. 
T-matrix: 
(C1) Connected trees of (shifted, renormalized) I’: (A2-4) for L=0 with S — I. 
(C2) Amputate external [>-propagators. 
(C3) External lines: appropriate to I) wave equation Kw = 0. (E.g., 1.) 
) 


(C4) External-line statistics: No symmetry factors; —1 for fermion permutation. 
Probabilities 
Danineeied = id (~~ P) T 


\P/2 Dal \D/2 
ap = |pl°6° (Sov) TT] SF, p= 2m 12m)? FY] 


all ae out (20)? 


in 


a ee —--2ImM, “~~ =2ImT,=—Im M? 
dt Ww ds m dt 
dP (Qn)? Gap (21) P/? 
Mo = (pi: p2)? — mims = ¥[8 — (m1 + m2) ][s — (7m — m2)*] 
do 1 
= 22m Trl’ (4D); 8 = —(Pi + pe)’, t= —(p1 + Bs)”, w= —(P1 + Pa)” 
12 
do |p3|P 71 Des 
os (eA? Oe 97 A CoM 
d§2 (20) Ty Ar2[3(s — m3 — M3)ws — m3w4] On) "Fal Ay9sP/2-1 oe 
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GAUGES (II,VI): Gervais-Neveu 
La =—4A-DA—iA*A’0,Ag — 4A°A’ AG AD 


ed 


Yang — Mills : L=La+Le, Lo =—MC(d+iA)*C — 10C(0- A+iA?) 
Gervais— Neveu: L=La+4m?A? 
Twistors 
Ga raly p= Pipe blr lah. l= lips 
(pq)* = [ap] = —[pa], (pa) (rs) + (ar) (ps) + (rp) (qs) = 
| | 


PS PPP lee a= 


p* = (p—)[-p], p” = (+0) [p+], v’ = (+p)[-p], B 


Spacecone 


[tat gouges row-nall [ral Ch alony Fel ea) 


(partly) temporal timelike: A° =0 lightcone: At =0, 6/dA7 


spacelike Arnowitt-Fickler : At =0 spacecone: At'=0, 6/5A* 
scalar unitary: ¢= ¢'| Gervais-Neveu: ¢= (¢), 6/d¢' 


n- A= 0, n= |+)(-|; tree ~ ( aaa (ee 


L=L2+1l3+ [4 


_ip? 
Lp = At(-3P*)A- + ae a 


ta = (Za*) (144 pat] + (ute) + (Ze) 0) 


by = ((A* pA] + (UW SA pew I-IA wt] 
lee) ees) feo ss 
Av = = A py yr =[-p], v= (+) 


(+ 
- + - + 
ref. lines : PAP BO Pot a Pim el 
Pp P Pp Pp 
Pe=l-N-l Pe=lt)) Pe=6t, P§=d4 


Background-field 
E> ot+@; Vee D+ iA, Pag = Fag + Dig Aut Aa As 
@-A-D-A, Ca-VC—CD°C+CD.-iA,C] 
1 — loop: K= —3(0 — iF © Sra) 
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SUPERSYMMETRY (II,IV,VI): Superspace 


. O aA _ . 0 a ~ 
da = —1 (sr = 1%»,.9) > Ga=—t & 7 16° ;  @)l=4a 


gd) OL 7a 
ge 30s + 50° Dpa:; (d*)' = —d 


.Py=(ed}=p9, fae=P asd, PP = OP 


een 19% 5, ds = 


Oe Oi On = O02, 2M = (64, 6", 2™) 
da = (da, da, Oa) = Ea Ou; [da,dp} = Tap de 
ie = as = — 16,5}, rest =0 
Super Yang-Mills 
Va=datiAa, [Va, Ve} = Tan’ Vo + iFap 
Vad =0 = 1 Was Va} = Va. Vat = 0; {Va Val = iV V3 
[Va, Vas] = CasWs; VaWe=0, V°Wot vias =0 
[ees V 33) = WO oiap + C5508); fos = oe V (eWay 


Actions 


Lya=-2 ur feos 1W°W. +6 fats VS [ee ge’ b+ [6 f) + he, 


Ly=2 = —gtr ( i d?90 W2 + / d*0 Voe"0) + Ff d*0 GV + ( / dO Got he.) 
— fa Heo +4] fOr ulo + Mey + he 


Lyaa = gtr - “ d’6 W? — / d46 eV ble’ dr + ( / a0 t€'7* b,[65, 0K] + he.)] 


Supergraphs 
(A23) 0’s: one for each vertex, with an [ d‘0. 
(A3’) Propagators: 


= rT). Oe | ame 1 ae 
(VV, 04, O6, 69) : (1. =e Te a a) wea (0 — 6’) 


(A43) Chiral vertex factors: d? on the ¢ end(s) of every chiral propagator, 
d? on the ¢ end(s), but drop any one such factor at a superpotential vertex. 
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LOOPS (VII,VIIT): Gamma function 


lo) aed 1 
EZ): = | dd \7-1e7* = te" I] ee: elm — Lexp =e a (n)(—z)" 
0 n=1 n n=2 
n Co 1 
y= lim (-m n+" 1) = 0.5772156649..,  ¢(z) = ) 
m=1 n=1 
ES) er ae 


REUSE) VEE aOR « py neat 


P(n+1) =n, 5 
2 z 
lim I'(z) & nee (=) , lim P'(az + 0) & V2n(az)? re V2e—-% 
Z—00 Zz e Z00 
: P(2) Ly) 
Be. -| dz 27 '(1-z oS drt (14r)*%= 
(x,y) ; (1 — z) , (1+7) Tea 
Regularization 
I'(a) = if dr 7o-1e-T(p? +m?) /2 
[s(p2+m)J* Jo 


1= f° a5(A-Sn), Ti = dav 


ae, [a et =o 
(gk?)¢ 
' 1 
rr 
| a(k + 3p)*3(k — 5p)? 
= Pate) 
1,,2)D/2-27>(D _1)r(9— 2B 2 Lf 22 — (my. 2] 10 
(sp") eae Oa 2) UG Do iP (p- x)"]} 


n=0 
T T T ba 
(Ek?) [E(k + py?) (Ap2)er8-D2 
Schemes 
MS: A 
MS: I(8)h, T(1—6)h, etc 
_1)Po/2 = 
1) a I'(1 — 2) er 


Running coupling 
Tiage tr f az iF°:(InO—+)Fa; Bi = $cr(—1)*( 


O O 2 2 2 
Ty = tr fae cF ain + Fin (sim) F, + — el/big (g2) 2/85 


4s°— 2); Chip = 2. CH= 2. 
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GRAVITY (IX) 
[Map, Vi = ViaNb]c > 5A" | Mpa, Vi = No Vas [Mae, MM) — 5 My\" 


IN" Mia = EN Mon + 2? M yg 


[Mag; py _ W(aC yy = 5A [Moa py = ry Was [Mag, MM” io 612 Mpy 
Va=Cat wae’ Meo, €a = Ca, Onn [Vas Vol = Lop We a 5 Rav Mac 
ads* = de" da" Grin, Gro nea Nab 


ad fu eh, e=dete™ L=—ZR+L(V,¥), R= Ra 
Rab ~ ata = 2T ap, Rar = Re acd, 0Su = fe e entde Ty 
Methods 


ni ee 


Dye 


) Dea) 2 YVezd 
) 
) 
) 


ds? =ds,+ds; => V=(V1,V2) 

Va(z) > Va(2’) 

ds? =@ds? => -V. = OV, + (V°S) Mas, 
Regt = PRs + BI VV — 55,54(VO)” 


a 
4): 


( 
( 
( 
( 
[Mis, V2] = No2Vi, [Mio, My3| = 122M43 
[Vi, V2 = [ex + W 1, €2 + wo] 

= {[e1, eo] + (e1w2) Mo — (e2w1)Mi} + {w1[Mi, V2] — wel Mo, Vi] — wiwe[Mi, M2]} 

Examples 


= 1,.-lp — 1-mn a, b cd __ 1-mn a, op 
Lg=—qe a eee aeaem En Rpg = ge em Cn Kpgad 


_-1-mnpq, adi p .B B : 
Ste ey, (thapd Os — Tina’ yea) 


gap 
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SUPERGRAVITY (X) 


Va = Bam On + 324°"Myg +4 Q49Miy+iAaY, — [Y, Val =—}Vo 


(Va, Va} = Tan ?Ve + $Rap? My, + $Rap? Mz, + iF apY 
{Va, Va} = BM 45, {Va, Va} = —1V 03 
[Va, —tV gs] = Cy,We — 3(V6B)M g 3, [-iV os, —tV 93] = Cas. fap — h.c. 


Wa = — BVa— GaP V5 + E(V?Ga\M pe + SWoPMyg + WaY + i2W? Moa 
faa = 19G(a'V ayy — 3(ViaB + 13W(a)V a) + Wag ?V — 3(V(aGay’) V3 
— (GV°B+ BB+ SiVW,) Mag — iggl(Via° G3) Ma? +a © 8] 
+ 3Wap” Moy + 7(ViaV' Gay )M py + i5(V (aWey)¥ 
Wapys = AV (aW 45) 
Ga=Ga, VaB=VaWa=VaWopy=0,  V'Gag = VaB — iWa 
VOW apy = ig V(eW) = —14V (6°Gya, V°W, + VWs = 0 
Ectoplasm 
c= fu (—t) ere? ep en em Lasop 


caged — Ga” pacal, Lined is tEnd,veaW L; Liied — en — 3B)L sa hi. 
£ 


Vi£=0 = (V7 + B)L 


Action 
SsGeH 3 / dx d'0 E'6@ 


Lg = Le + Ly + e'L, 
Lg =—je'R, Ly =e mag len®®, Vr}Vqa, La = —3 (Ga)? + 3BB 
Wmbe = Baty - See _ Tatoe))]; iG oa Ean "Ga + ita Dr” 
SCX = —i(€ + Ee“), 6Um” = Vinee + im? (€sG° — €3038) 
B= —26%tag”, ee. —€? (tea + $C Bats 5”) + h.c. 


LS =," 6h ViniUn)” + i(WapG” s = 43988 =e b) = Costes” os Casta” 
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STRINGS (XI) 


Z -3/¢ Oo G9? On X®) + (OnX? ran 


(open) 


(s) jon aoth . 
a(s)=Kas+-, k= 
ki + (closed) 


=F |= 
as fla(s) + gJla(s) +5 - I ---[a(s) +I] 1 
= { ji | 


S——0Oo 


t fixed 


= [ in g°I'[—a(t)][—a(s)]° 


t 
~ [ lim enter es): cos) et TOR. 
8——00 S 


0 fixed 


Conformal field theory 
L=Xa-wyVXw) > T= xX990x + (3 — w)A(Xx) 


L = 4n"(Vdi) -(Voj) + 5Ru'd: => T = bnij(0¢')(0¢") — 1;07¢" 
1 
(Olx(z)X(2')|0) = 7 


z 
(0|d'(z)G?(2')|0) = —n'In(z — 2) 
eit (2) pta-9(2') = (z _ Pa kel a eae aa ae) es 


277% 


: / © \e)T(), rw) = A(2)(AX(w)) (2) + W(AA)(2)X(w(2) 


: / = 2) (e"), 6) = A(2)(06')(2) + HOA (2) 


271 
w (ce?) = 5a? +ip-a 


anomaly ~ 6(w—4)?-3 


Bnd.state . VC1,VIIB3,5,C2, VITIB4,XIA1-2 
Bracket ...........0..0 eee ee JTA1-2,XITA1 
BRST....... VIA-B, VITA1,IXB1,XIB7,XI1 


Cabibbo-Kobayashi-Maskawa matrix IVB3 


Cartan metric ....... 0.0.0. c eee eee IB2 
Cartan subalgebra................05. IB5 
Casimir operator. ............00 ee eee IB2 
Causality .............000. VA4,C6,VIA1 
Center-of-mass frame....... TA4,VC7,VIC4 
Central charge................. IVC7,XC5 
Channels..............000005 VIIB7,XIA1 
Chan-Paton factors.............. XIA4,C7 


C. conj. IA5,1IIC5,1TA4,IVA4,B1,XC1,XIA4 
Chern-Simons form IIIC6,IVC5, VIIIB2,XC5 


Topics are listed by subsection (or section, etc.). 
Abelian ..... 2.00000 ccc cc ceeeee eee IB2 
ACHOM bs c3.4.d0 Soda dod deeded ic ITI-IV 
Adjoint representation........... IB2,5,C2 
Advanced propagator........... VA3,B1-2 
Affine parametrization.............. IIB2 
Affine transformation ................ IB1 
Analytic S-matrix theory............ XIA1 
Anni’l’n op.IC1,1B3,C5,N1A2,VIA1,3,XIB2 
Anomaly........ VITTIA7,B2-4,C1,XIB4,C7 
Antibracket ......... 0.0.00. c eee ee XIIC1 
Anticommutation................000. TA2 
Anticommuting number/variable...... IA2 
Antifield ..................4. VIA3, XIIB3 
Anti-Gervais-Neveu gauge......... VIB4-5 
Antighost .......... 0.02.0 eee ee eee VIA2 
Antiparticle................ TA4-5,HIB4-5 
Antisym.tensor ITB2, VIIA7,XA3,XIIA5,B3 
Arnowitt-Fickler gauge ............. IIC2 
Asymptotic freedom..... VITIA3-4,C,XIA7 
Atiyah-Drinfel’d-Hitchin-Manin...... ITIC7 
Auxiliary variable... ITIA1,5,B2,C2,4,1VC2 
Axial anomaly............ VITA7,B2-4,C1 
Axial current........... IVA4, VITIA7,B2-4 
Axial gauge........... IIC2, VIB6-8,IXB3 
Axial vector... 0.0... ccc cece cee es TA5 
Axion......... VIIIC4,XIA4-7,C6-7,XIIB8 


Backer’nd fieldVC1,3, VIIB2,XIA5-6,XIIB7 
Background-field gauge ... VIB8,10,VIIIA3 


Ba Pinched aceite wee gee ag ead IVB1 
Baryon number ................. IVA4,B1 
Baryon 23623 oped pened ee bineekaels IC4 
Bhabha scattering................-. VIC4 
Bianchi id. . WA7,IIC1,IVC3,IXA2,XB2,C5 
Big Bang..............000-0-- IVA7,IXC3 
Birkhoff’s theorem................ IXC5,7 
Bjorken scaling variable........... VIIC3 
Black hole... 0.2.2... ...2 0.0 eee eee IXC7 
Bohr-Sommerfeld quantization ....... VA2 
Borel summation/transform........ VIIC3 
Bosonization.......... VITB5,XIA5-6,B7-8 


Bnd.cond. IIIC2,6-8,VA,C3, VIIIC5,XIA3,B1 


Chirality esci02.ci44085 eden IA7,IVC1 
Chiral projector................... ITA6-7 
Chiral representation.............. IVC4-5 
Chiral spinor.................. ITA7,11C4 
Chiral superfield........ TIC5,IVC1-2, VIB7 
Chiral superspace.................. IVCl 
Chiral symmetry......... IVA4, VIIIB2-4,7 
Chiral theory.................002005 IIA7 
Chromodynamics......... IC4,IVB1,VIIC4 
Classical group.............2.005 IB4,11C3 
Classical limit ............ TA1-2, IITA2-3,V 
Clebsch-Gordan- Wigner coefficient IB5,ITA4 
Closed string ............. IVB1,VIIC4,XI 
Coherent state........... 00... cee eee TA2 
Cohomology ............. VIA1,3,XB2,XII 
Coleman-Weinberg mechanism...... VITB2 
Colinear divergence/particle........ VITAG6 
GOlOt ie hog is ees iad eae IC4,IVA4 
Color decomposition/expansion VC9,VIIC4 
Color ordering............... VC9,VIC1-3 
Compact group ..........0. cee eee IB2,4 
Compactification .............. IIB4,XIA4 
Compensator............ IVA5, IXA7,XA3 
Complex projective........... IVA2, VIIB3 
Complex representation............. IB1-2 
Component approach................ XB4 


Component transformation...... IIC2,XB3 
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Compton scattering ................ VIC4 
Confinement................ IVB1,VIIIB7 
Conformal ............. TA6,IIB-C,ITIC5-7 
Conformal anomaly.......... VITIC1,XIB4 
Conformal boost............. TA6,IIB1,6-7 
Conformal field theory......... XIB,XIIB8 
Conformal gauge...............000- XIB1 
Conformally flat.................0.. IXC2 
Connected graph.................045 VC2 
Conservation ............. TA1,IIIB4,IXA6 
Constituent quark ...............0.0. IC4 
Constrainitie: « casuadenceus Oolave eA es ITTA5 
Constructive quantum field th. WIC4,VIIC3 
Continuum limit........ VA1,VIUIB7,XIA7 
Contour integral ................00.. IIA1 
Contracted indices.................4. TAl 
Contraction ....... 0.00. cece ee eee IIC3 
Coordinate representation ............ IC1 
Coordinate transform. . [A,C2,IIIB1,3,IXA1 
Coset Space... 6. eee cece nee eee IVA3 
Cosmological red shift......... IVA7,IXC4 
Cosmological term..... IXA5-7,C2,XB1,6-7 
Cosmology ..........0+000e eee IVA7,IXC3 
Counterterm....... 00.000. e eee VITAL 
Covar.deriv. IIC2,IIJA4,C1,IVC3,IXA2,XA1 
Covariant momentum........... TA5,IIC1 
Covering group... 1.2.00... eee ee eee IC5 
CPG) MmOdel ys atre tere huw ey IVA2, VIIB3 
CPT theorem ...............4. IVB1,VC8 
Creat’n op. 1C1,IIB3,C5,NTA2,VIA1,3,XIB2 
Critical dimension............... XIA7,B4 
Crossing symmetry ............ VC8,VIC4 
Cross product ...... 0.0.00 c eee eee IIA1 
Cross section...........000 eee VC7,VIC4 
Curreniti..cni pes ene Rests IIA7,I1A4,B4,VC3 
Current quark................00 2000 IC4 
Curvature'ad a. ide sco eelntale de neeelens IXA2 
Curved indices...............002005 IXAl 
Cut propagator ........ 00... cee eee VC7 
Cutting rule....... 0... ......02 2008. VC6 
Decay rate........ VC7,VIIA4, VIIIA3,B4 
Deceleration parameter........ IVA7,IXC3 
De Donder gauge................04- IXB1 


Deep inelastic scattering........... VITIC3 


Defining representation............... IB4 
Degree of freedom ... IVC3-4,VB3,XC3,XII 
Density............2..-. ITIB1,IXA4,XTA6 
Density parameter ............ IVA7, IXC3 
De Sitter space..............0 005. IXC2-3 
Determinant ............. IB3,C2,5,1TA1,5 
Differential cross section............. VC7 
Differential form........... IC2,IXA4,XB2 
Dilatation .................0.. TA6,IIB1,6 


Dilaton .... IVA7,IXA7,XIA4-6,C4-7,XIIB8 
Dim.red IIB4,IIC8, VIB6, VITA1, VIIIA3,XC 


Dimensional regularization............ Vil 
Dimensional transmutation......... VITB3 
Dirac delta. ..... 00.00.00. ce eae IIA1,VA2 
Dirac equation ................ TIB2,1T1A4 
Dirac matrices/spinor....... 1C1,ITA6,XC1 
Diréct: product /suttie<: cadiervepeeheest IB2 
Discrete symmetry..............02005 TA5 
Divergence. nieve seta ee Gee Peug sede Vil 
Division algebra ................204. IIc4 
Double covering. ..............2-0005 ITA3 
Drell-Yan scattering .............. VIIC3 
Duality . IIA7,I1C4,VIIB5,X A3,B5-6,C6,XI 
Dual model.............. 0022-000. XIA1 
Dual representation/space............ IB1 
Duffin-Kemmer matrices............. TIB4 
DuUStsc.3 he avai es Gnd he ndtlete weed IXB2,C3 
Dynkin index.............00...0 2000 IB2 
Ectoplasm sible Ba sae Naar aclesh co anata ate ek XB2 
Effective action ..............002-04- VC2 
Effective potential................. VITB2 
Einstein-Hilbert action.............. IXA5 
Einstein summation.................. TAl 
Elastic scattering ................... VC7 
Electromagnetism IA5,C4,ITA7,B5,B7,IVB2 
Energy...........2-006- TA1,5,1JA5,B6,C1 
Energy-momentum tensor . . IIIA4,B4,IXA6 
Equivalence principle............... IXA4 
Euclidean space........... TA4,11IC4,VB4 
Euler number.......... VIIC4,IXA7,XIA2 
Euler-Mascheroni constant ......... VITA2 
Euler’s theorem ................0.. VIIC4 
Event horizon................02-005 IXC7 
Evolution equations............... VITIC3 


Exceptional group............... IB4,XC6 
Exclusive scattering............... VIIC3 
Explicit supersymmetry breaking .... IVC6 
Explicit symmetry breaking .......... IVA 
Extended supersymmetry..... IVC7,XC3-6 
External field.............. IITA1,B3-5,VC 
External line..................002005 VC2 
Extrinsic length.................... ITIB1 
Fee ten eccaia xt ede VIIC4,XIA7 
Factorization. ................005- VIIC3 
Family? yc...c:cge ana rirk kel hecote IC4,IVB3 
Fayet-Iliopoulos term ............. IVC5-7 
Fermi-Feynman gauge. . VIB2-3,9, [IXB1,XI1 
P6FMION aso have ener ed ne eine de owes TA2 
Fermion doubling................. VITIB7 
Feynman diagram/graph............. VC2 
Feynman parameter ............... VITA2 
Feynman tree theorem............... VC6 
Field strength ....... ITA7,B3,5,01C1,xXA2 
Fine (Ung: bie ieee bo4 Ole eels oes XB6 
Finite theory ................... VITIA5-6 
Finite transformation ................ IA3 


1st-order HIA5,C4,IVC5,IXA5,XB4-5,XIA3 
1st-quantization VB1,VIIB1,C5,XIB-C,XII 


Plat: imidi¢es:: even hee ioe cei ees es IXAl 
Flat superspace ..............2.-04- IVC3 
Flavor sscgeeharesd hee raer ses IC4,IVA-B 
Flavor-changing neutral currents..... IVB3 
Foldy-Wouthuysen transform. . IIB5,VITIB6 
Fotit-V6Ct0r iano sonnei Menetas bier TA4 
Fragmentation function ........... VITIC3 
Functional. ........... 000.0000 IC1IIA1 
Functional integral. ................. VAI 
Functional derivative ............... IA1 
Fundamental representation........... IB4 
Furry’s theorem............ VITA5, VITIB3 
Galilean boost PROUDA 0onee Gain ie IAl 
Gamma matrix............. IC1,IIA6,XC1 
Gauge....... IIB2,C1, VIB, IXB,XA4,XIB1 
Gauge field...............0..004- IIA7, III 
Gauge invar. . ITA7,I1A5,B1,C1,IXA1,XIA3 
Gauge parameter ..............4. ITA7, III 


Gal PiNON seiko ke eels eh aes 4 IVC5 
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Gauss-Bonnet theorem.............. IXA7 
Gaussian . . IB3,IIC3,VA2,5,B1,VIIA2,XIA7 
General linear group ................. IB4 
General relativity .................004. IX 
General superfield.................. IVCl 
Generating functional.............. VC1-2 
Generator i3.06008 eased (ee ouaierhs IAl 
Genus Sheeting ces es Piles tes Bk XIA2 
GeodeSiGiaite eli ell oo eee IXB2-4 
Gervais-Neveu gauge.............. VIB4-5 
Gervais-Neveu model ............... IVA6 
GhOSt xs os -Hiesde nes ace oa ee VIA, XII 
Ghost number............... VIA1,XIIB1 
Glashow-Iliopoulos-Maiani mechanismIVB3 
Glashow-Salam-Weinberg model..... IVB2 
GLDAP equations................ VITIC3 
Gliozzi-Scherk-Olive projection ... XIB5,C6 
Global symmetry/transformation ........ I 
Glueball.......... IC4,IVB1,VIC4,VIHIB4 
GUI O 2x eye ieee a olds Bees ieee Satan IVC5 
Gluon............ IC4,IIIC,IVB1,VIC, VII 
Goldstone boson/theorem............ IVA 
Goldstone fermion............. IVC6,XB6 
Graded commutation................. TA2 
Graded group/symmetry..... TA2,11C3, XII 
Grand Unified Theory....... IVB4,VHIA4 
Graphs2..aiveas.nve reed enone ped bote VC2 
Gravitational collapse/radius........ IXC7 
Gravitational red shift.............. IXC6 
Graviton . IIIA4,C1,IVA1, IXB1,C1-2,XIA,C 
GraVvitysn vec visnoredeowebastovaw: TTA4,1X 
Green function ............... VA-B,XIC3 
Green-Schwarz superstring ..... XIB5,C6-7 
Gribov ambiguity .................. VIB2 
GrOlUp:.omde ratio uh Yes eine eave res TA3-II 
Group contraction ............0..06. IIC3 
Group metric............ 0. cee eee eee IB4 
G-scheme........ 0. cee cece eee ne VITA3 


Hagedorn temperature.............. XIC2 
Hamiltonian .......... TA1,IIIA,VA1,VIA1 
Hamiltonian density................ IA3 
Hamilton-Jacobi equations........... VA2 
Hamilton’s equations................. TA1 
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Handle’. 2282.0 sr) sateen Pate VITIC4,XTA2 
Hard particle.................00.. VIIC3 
Hardy-Ramanujan formula.......... XIC2 
Harmonic gauge.................0-5 IXB1 
Helicitys. 22c0 eee IIB7,C5,11IC2 
Heterotic string......... XIA4-6,C6,XIIB8 
Hidden matter.................00005 XB7 
Higgsino...... eee eee ee eee IVC5 
Higgs mechanism................-.. IVA6 
Higgs scalar................00.. IC4,IVA6 
Higher dimensions.................... XC 
Hilbert-space metric....... IB4,VC5,XIA1 
Holees) ee snie eerie wer ar bineale tes < VIIC4 
Homogeneous space.......... IVA7,IXC3-4 
Hubble constant .............. IVA7,IXC3 
Hypermultiplet /hypersymmetry ..... IVC7 
Ldcal MIPMING ole trace Aid cee ees Be IC4 
i€ prescription ................04. VA3,B1 
1G i Gi een eee eas aie er eee Bee Eee XIIB-C 
Improved perturbation............. VIIC1 
Inclusive scattering ............... VIIC3 
Index notation ................. IB, TIA4-5 
Induced metric...............02-005 XIA3 
Inelastic scattering .................. VC7 
Inhomogeneous group...........--- XIIB1 
Infinitesimal transformation........... TA1 
Tnfinity ce tise eget eigen ete cn Ges VII 
Infrared divergence.......... VITA6,VITIB 
Infrared slavery ..............002005 IVB1 
Inhomogeneous transformation..... TA4,B1 
Inner product ...............-. IB1,XIIB2 
Instanton...............005 IIC6-7, VIIC2 
Interaction picture .................. VA4 
Internal line........... 0.00.0... 00 VC2 
Intrinsic length..................... IIB1 
Inversion... 0.2.0... eee eee eee eee TA6 
Irreducible representation............. IB2 
IsOSpil 3.22. aat eee eda eat IVB2 
Isotropic space.............. IVA7,IXC3-4 
Jacobi identity 0.000.000.0000. 1A1,3 
Jacobi theta function ............... XIC2 
JObiiiie Sine ween nye ne hanes 3 VIUIC2-3 


Kahler manifold /potential .......... XB6 
Killing equation/vector....TXA2,6,B2,C4,6 
Kinoshita-Lee-Nauenberg .......... VITAG 
Klein factor/transformation . [A2,VC5,XC1 
Klein-Gordon equation .............. IIB1 
Kronecker delta. .............02.00005 TAl 
Kruskal-Szekeres coordinates ........ IXC7 
Latte grec s tei Sees VC7 
Ladder diagram.................04- XIA1 
Lagrange multiplier............... IMA1,5 
Lagrangian ................ 00.02 eee TA1 
Lagrangian density................. IA3 
Lamb ‘shift ¢..40-:0+ anther sees wees VITIB6 
Landau equations/singularity ........ VC8 
Landau gauge................005- VIB2-3 
Landau ghost ..............2.0000- VIIC2 
Laplace transform...............4. VIIC3 
Lattice sc scncd ear iton Gunes eats VIUIB7,XIA7 
Legendre transform............. VC3,XB5 
Length............. TA4, TTB1,IXB2,X1A3 
Lipton: fawisb waits aren wae eed IC4,IVB 
Levi-Civita tensor ................ TA5,B3 
Lie algebra/bracket /derivative/group . . 1A3 
Light bending...................04. IXC6 
Lightcone rst ts fie sired ew eneraaaes TA6 
Lightcone basis...............02-000- TA4 
Lightcone coordinates ........ TA4,6,XT1A1 
Lightcone frame................... IIB3,6 
Lightcone gauge............ ITB3,H1B2,C2 
Lightcone spinor .................04- XC4 
Linear multiplet .................... XA3 
Litt ecto. po we tae oe ac VC2,8, VIIIB7,XIB7 
Little group............2..02--00 0 TIB3-4 
Local inertial frame................. IXC4 
Localitys.g2 vii cian eee ek ees IIA1,3 
Local symmetry/transformation........ III 
Loop graph .... HIB5,VC,VII-VII,XIA2,C 
Lorentz connection ................. IXA2 
Lorentz force law..............2.005 IXB2 
Lorentz transformation ........ TA4-5,ITA5 
Lorenz gauge................. VIB2,IXB1 
Lowering operator........... TA6,B5,VIA1 
Maenctic charge ...........0..008- IIC8 


Magnetic moment..... W1C4,VIA3, VIIB5 
Magnetic monopole................. ICs 
Majorana spinor.......... ITAG,TIA4,XC2 
Mandelstam variables...... TA4,VC4,VIC4 
Manifest symmetry ................-. IB1 
Maximal helicity violation......... VIC1,3 
Maxwell’s equations........ IIA7,B2,01A4 
Mechanics................ TA, IIIB, VB,XII 
Mellin transform .................. VIIC3 
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